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Title of Invention 

HEPATITE C VIRUS ENVELOPE TWO PROTEIN (E2) WICH LACKS ALL 
OR PART OF THE HYPERVARIABLE REGION ONE (HVRI) , 
CORRESPONDING NUCLEIC ACIDS, CHIMERIC VIRUSES AND USES 

THEREOF 

Field of Invention 

The present invention relates to nucleic acid 
molecules that encode a hepatitis C virus (HCV) envelope two 
protein which lacks all or part of the hypervariable region one 
(HVRI) of the envelope two (E2) protein. The invention further 
relates to the use of the nucleic acid molecules and their 
encoded polypeptides as vaccine candidates. 

Background of Invention 

Hepatitis C virus (HCV) is a positive-sense single- 
strand RNA virus belonging to the Flaviviridae family of viruses 
(Rice, 1996). 

At present, more than 80% of individuals infected with 
HCV become chronically infected and these chronically infected 
individuals have a relatively high risk of developing chronic 
hepatitis, liver cirrhosis and hepatocellular carcinoma 
(Hoofnagle, 1997) . The only effective therapy for chronic 
hepatitis C, interferon (IFN), alone or in combination with 
ribavirin, induces a sustained response in less than 50% of 
treated patients (Davis et al., 1998; McHutchinson et al., 
1998) . Consequently, HCV is currently the most common cause of 
end-stage liver failure and the reason for about 30% of liver 
transplants performed in the U.S. (Hoofnagle, 1997). As a 
result of the inability to develop a universally effective 
therapy against HCV infection, it is estimated that there are 
still more than 25,000 new infections yearly in the U.S. (Alter 



WO 01/21807 PCT7US00/25987 

1997) Moreover, since there is no vaccine for HCV, HCV remains 
a serious public health problem. 

A remarkable characteristic of HCV is its genetic 
heterogeneity, which is manifested throughout the genome (Bukh 
et al., 1995). HCV circulates as a quasispecies of closely 
related genomes in an infected individual. Globally, six major 
HCV genotypes (genotypes 1-6) and multiple subtypes (a, b, c, 
etc.) have been identified (Bukh et al . , 1993; Simmonds et al., 
1993) . The nucleotide and deduced amino acid sequences among 
isolates within a quasispecies generally differ by < 2%, whereas 
those between isolates of different genotypes vary by as much as 
35%. 

The most heterogeneous regions of the genome are found 
in the two envelope genes El and E2; in particular, the 
hypervariable region 1 (HVR1) at the N-terminus of E2 (Hijikata 
et al., 1991; Weiner et al., 1991). 

The fact that the HVR1 is the region of the genome 
with the highest degree of genetic variability suggests that it 
is under strong immune pressure. Indeed, Ray et al. (1999) 
recently reported that patients who developed a chronic 
infection had a higher rate of non-synonymous mutations within 
the HVR1 as compared with the El protein while the reverse was 
observed in patients who were able to clear the infection. 
These authors therefore hypothesized that the HVR1 region of HCV 
might act as a decoy antigen by stimulating a strong immune 
response that is ineffective in clearing viremia. 

However, the proposed role of the HVR1 as an 

immunologic decoy is not easily reconciled with prior studies in 

which a hyperimmune rabbit serum raised against the HVR1 was 

demonstrated to be capable of neutralizing HCV in vitro (Farci 

et aJ., 1996, Shimizu et al., 1996). Moreover, the presence of 

amino acids in the carboxy-terminal half of HVR1 which are 
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conserved across genotypes suggests that HVR1 might be required 
for HCV replication. 



Summary of Invention 

The present invention relates to nucleic acid 
molecules which lack all or part of the coding sequence of the 
HVR1 region of the envelope 2 (E2) gene of hepatitis C virus 
(HCV) . 

In the first embodiment, the nucleic acid molecule of 
the invention comprises the genome of an infectious hepatitis C 
virus in which the HVR1 of the envelope 2 gene of the infectious 
HCV has been deleted. 

It is therefore an object of the invention to provide 
nucleic acid sequence which encodes infectious hepatitis C virus 
lacking HVR1. Such nucleic acid sequence is referred to 
throughout the application as "AHVRl-inf ectious nucleic acid 
sequence. " 

In a second embodiment, the nucleic acid molecule 
comprises chimeric genomes of chimeric hepatitis C viruses in 
which the structural region (core and envelope genes) or the 
envelope gene of a pestivirus genome, [for example, a bovine 
viral diarrhea virus (BVDV) ] or a flavivirus genome (for 
example, a dengue virus or a yellow fever virus) are replaced by 
the corresponding structural region or El and E2 genes of an HCV 
in which the HVR1 region of HCV E2 has been removed. It is 
therefore an object of the invention to provide nucleic acid 
sequence which encodes chimeric HCV lacking the HVR1 of the HCV 
E2 gene. Such nucleic acid sequence is referred to throughout 
the application as "AHVRl-chimeric nucleic acid sequences." 

The present invention also relates to the in vitro and 
in vivo production of AHVRl-inf ectious HCV or AHVRl-chimeric HCV 
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viruses from the AHVRl-inf ectious or AHVRl-chimeric nucleic acid 
sequences of the invention. 

The present invention also relates to the use of the 
AHVR1 viruses of the invention to identify cell lines capable of 
supporting the replication of the viruses. 

The invention also relates to the use of the AHVR1 
infectious or chimeric nucleic acid sequences of the invention 
in the production of AHVRl-inf ectious HCV or AHVRl-chimeric HCV 
respectively, and the use of these virions for the development 
of inactivated or attenuated vaccines to prevent HCV in a 
mammal . 

In a third embodiment, the DNA construct comprises an 
HCV E2 gene lacking the HVR1 region. Such a "AHVR1-E2 gene" may 
also be linked in tandem in the DNA construct with an HCV El 
gene. When contained alone or in tandem with the El gene, the 
AHVR1-E2 gene may be further modified at its carboxy-terminus to 
produce either a secreted or surface expressed AHVR1-E2 protein. 

The invention further relates to pharmaceutical 
compositions and DNA-based vaccines which comprise the nucleic 
acid molecules of the invention. 

The invention also relates to methods of preventing or 
treating HCV in a mammal comprising administering the nucleic 
acid molecules of the invention to a mammal in an amount 
effective to stimulate the production of a protective humoral 
and/or cellular immune response to HCV. 

The invention also provides a kit for the treatment or 
prevention of HCV, the kit comprising a DNA molecule of the 
invention useful as an immunogen in generating a protective 
immune response to HCV. 

The invention further relates to the use of the 
nucleic acid molecules of the invention as immunogens to 
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generate antibodies to the AHVRl-inf ectious HCV, the AHVR1- 
chimeric HCV or the AHVR1-E2 protein, preferably neutralizing 
antibodies. The invention therefore relates to the use of such 
antibodies in passive immunoprophylaxis and to pharmaceutical 
compositions which comprise these antibodies. 

The invention also relates to transformation of host 
cells with nucleic acid molecules of the invention to produce 
host cells which express AHVRl-inf ectious HCV, the AHVR1- 
chimeric HCV or the AHVR1-E2 protein. 

The invention further relates to the use of host cells 
expressing AHVRl-inf ectious HCV, the AHVRl-chimeric HCV or the 
AHVR1-E2 protein as immunogens to stimulate a protective immune 
response to HCV. 

The present invention, of course, also relates to 

AHVR1 E2 protein produced from the AHVR1-E2 gene constructs of 
the invention or obtained from the AHVRl-inf ectious HCVs or 
AHVRl-chimeric HCVs of the invention. These AHVR1-E2 proteins 
may be used as vaccines for immunizing mammals, especially 
humans, against HCV. 

Brief Description Of Figures 

Figures 1A-1F show the nucleotide sequence (SEQ ID NO: 
1) of the infectious hepatitis C virus clone of genotype la 
[H77C (AHVR1) ] which lacks the hypervariable region one (HVR1) of 
the second envelope protein and Figures 1G-1H show the amino 
acid sequence (SEQ ID NO: 2) encoded by the clone. The complete 
sequence of H77C(AHVR1) is identical to pCV-H77C but lacks the 
fragment from nucleotide positions 1491 to 1571 which encodes 
HVR1 (Yanagi et al., 1997; ATCC accession number PTA-157). 
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Figure 2 shows in vitro transcription-translation of 
constructs E1E2-715 (Lanes 1, 3, 4, 5, and 6) and AHVR1-7 (Lane 
8) with rabbit reticulocyte lysates with and without the 
addition of canine microsomal membranes. 

Figure 3 shows the results of qualitative reverse 
transcriptase-nested polymerase chain reaction (RT-PCR) for HCV- 
RNA, logio HCV GE (genome equivalent) titer (in-house RT-PCR and 
Amplicor HCV Monitor, Roche Diagnostics), second generation 
ELISA for anti-HCV and serum levels of alanine aminotransferase 
(ALT) in chimpanzee 1590 following transfection with RNA 
transcripts of the HCV deletion mutant H77C(AHVR1). 

Figure 4 shows infection of chimpanzee 96A008 with HCV 
lacking HVR1. Serum samples were collected weekly from the 
chimpanzee and monitored for HCV-RNA [in-house RT-nested PCR and 
HCV Monitor test version 2.0 (Roche)], HCV antibodies (second 
generation ELISA, Abbott Laboratories) and liver enzyme levels 
(ALT, Anilytics) . PBMC were collected weekly and tested for 
HCV-specific proliferative capacity (peripheral CD4) with a 
panel of recombinant HCV proteins [C22 (core), C33-c (NS3) , C100 
(NS3-NS4), and NS5] - The peripheral CD8 + T cell response 
(peripheral CTL) was tested by stimulating PBMC with a large 
panel of HCV peptides corresponding to known CTL epitopes. 
Expanded T cells isolated from liver biopsy samples were tested 
for HCV-specific proliferative responses (Intrahepatic CD4) + , 
positive; - negative. Liver biopsies were examined also for 
necroinf lammatory changes [0 (normal), 1 + , 2+, 3+, 4+]. At week 
0, the chimpanzee was inoculated intravenously with 90 ml of 
plasma from chimpanzee 1590 (week 4 after transfection) . At a 
titer of 10 GE/ml this represented an inoculum of approximately 
900 genome equivalents of HCV. 
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Detailed Description of The Invention 



The present inventors surprisingly observed that HCV 
cDNA from which the HVR1 of the E2 gene had been deleted, 
encoded a hepatitis C virus which was able to replicate in vivo 
and stimulated a strong cellular immune response. Figure 1A-1F 
shows the nucleic acid sequence of this infectious HCV clone 
which was constructed using pCV-H77C (ATCC accession number PTA- 
157), an infectious HCV clone of genotype la. 

The present invention therefore relates in one 
embodiment to a nucleic acid molecule which comprises the genome 
of an infectious hepatitis C virus in which the HVR1 of the 
envelope 2 gene of the infectious HCV has been deleted. 

The E2 protein consists of amino acid 384 to 74 6 of 
the HCV polyprotein and the HVR1 of the E2 protein consists of 
amino acid 384 to 410. 

It is therefore an object of the invention to provide 
nucleic acid sequence which encodes infectious hepatitis C virus 
lacking HVR1 . Such nucleic acid sequence is referred to 
throughout the application as "AHVRl-inf ectious nucleic acid 
sequence . " 

It is understood that the deletion of HVR1 of the E2 
gene in the HCV genome can be made in infectious HCV clones of 
any genotype. For example, infectious HCV clones of different 
genotypes which have been constructed include those of 
Kolykhalov et al., (1997) and Yanagi et al. (1997, 1998) [who 
reported the derivation from HCV strains H77 (genotype la) and 
HC-J4 (genotype lb) of cDNA clones of HCV that are infectious 
for chimpanzees], and Yanagi et al. (Yanagi, 1999) [who reported 
the construction of an infectious cDNA clone from HCV strain HC- 
J6 (genotype 2a) ] . Preferably, the infectious HCV clones are 
infectious HCV clones of genotype la (ATCC accession number PTA- 

7 



WO 01/21807 PCT/US00/25987 

157), lb (ATCC accession number 209596) or 2a (ATCC accession 
number PTA-153) . 

In a second embodiment, the nucleic acid molecule of 
the invention comprises the genome of a flavivirus or pestivirus 
in which one envelope gene of the flavivirus or pestivirus is 
replaced by the E2 gene of an infectious HCV from which the HVR1 
region of HCV E2 has been removed. 

It is therefore an object of the invention to provide 
nucleic acid sequence which encodes chimeric HCV lacking the 
HVR1 of the HCV E2 gene. Such nucleic acid sequence is referred 
to throughout the application as "AHVRl-chimeric nucleic acid 
sequence." 

The Flaviviridae family of viruses which may be used 
to make the AHVRl-chimeric nucleic acid sequences include, but 
are not limited to, dengue virus, bovine viral diarrhea virus, 
yellow fever virus and Kunjin virus.. 

Preferably, the AHVRl-chimeric nucleic acid sequences 
of the invention are made using the structural region or El and 
E2 genes of an infectious HCV clone of any genotype. 
Preferably, the infectious HCV clones are infectious HCV clones 
of genotype la (ATCC accession number PTA-157), lb (ATCC 
accession number 209596) or 2a (ATCC accession number PTA-153) . 

The deletion of HVR1 which may be made in the sequence 
of the invention is at least 5 amino acids in length, preferably 
10 amino acids, and most preferably the entire 27 amino acids of 
HVR1. The nucleic acid molecules of the invention therefore 
comprise genomes of HCV which lack at least a fragment of the 
HVR1 sequence . 

Of course, it is understood that in deleting all or 
part of the HVR1 sequence, one may extend the deletions further 
in the carboxy-terminal direction of the E2 protein as long as 
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the resultant nucleic acid molecule is capable of replicating in 
a chimpazee and producing the E2 protein lacking all or part of 
HVR1. 

The AHVRl-inf ectious nucleic acid sequence of the 
invention may further include one or more mutations such as 
those described in Example 4 which result in amino acid changes. 
One mutation is located within the E2 gene and results in a 
change from leucine to histidine at amino acid position 615. 
Another mutation is located within the NS3 serine-protease 
domain and results in a change from arginine to histidine at 
amino acid position 1143. A third mutation is located within 
the NS5B RNA-polymerase domain and results in a change from 
glutamic acid to aspartic acid at amino acid position 2875. 

The present invention further relates to the in vitro 
and in vivo production of hepatitis C viruses from the AHVRl- 
inf ectious nucleic acid sequences of the invention, and the 
production of chimeric viruses from the AHVRl-chimeric nucleic 
acid sequences of the invention. 

In one embodiment, the AHVRl-inf ectious nucleic acid 
sequences or the AHVRl-chimeric nucleic acid sequences of the 
invention can be inserted into an expression vector that 
functions in eukaryotic cells. Eukaryotic expression vectors 
suitable for producing high efficiency gene transfer in vivo are 
well known to those of ordinary skill in the art and include, 
but are not limited to, plasmids, vaccinia viruses, 
retroviruses, adenoviruses and adeno-associated viruses. 

The sequences contained in the recombinant expression 
vector can then be transcribed in vitro by methods known to 
those of ordinary skill in the art in order to produce RNA 
transcripts which encode the HCV or chimeric viruses of the 
invention. The HCV or chimeric viruses of the invention may 
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then be produced by transfecting cells by methods known to those 
of ordinary skill in the art with either the in vitro 
transcription mixture containing the RNA transcripts or with the 
recombinant expression vectors containing the nucleic acid 
sequences described herein. 

Where transfection of cells with recombinant 
expression vectors containing the nucleic acid sequences of the 
invention is used, transfection may be done by methods known in 
the art such as electroporation, precipitation with DEAE-Dextran 
or calcium phosphate, or incorporation into liposomes. 

Suitable cells or cell lines for culturing the HCV or 
chimeric viruses of the invention include, but are not limited 
to, EBTr and Huh7 . 

The present invention also relates to the use of the 
AHVRl-inf ectious nucleic acid sequences or the AHVRl-chimeric 
nucleic acid sequences of the invention to identify cell lines 
capable of supporting the replication of HCV and the chimeric 
viruses of the invention. 

The invention further relates to the use of the AHVRl- 
inf ectious nucleic acid sequences or the AHVRl-chimeric nucleic 
acid sequences of the invention to develop inactivated or 
attenuated vaccines to prevent hepatitis C in a mammal. For 
example, virions from cell lines infected with the HCV or 
chimeric viruses of the invention, or transfected with a AHVRl- 
inf ectious nucleic acid sequence or a AHVRl-chimeric nucleic 
acid sequence of the invention, can be purified from the cells 
and inactivated by methods known to those of ordinary skill in 
the art. The inactivated virions can be used to immunize mice, 
and if neutralizing antibody to HCV is produced, the virions can 
then be used to immunize chimpanzees to determine whether the 
antibodies are protective. Alternatively, cells infected with 
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the viruses of the invention may be passaged in cell culture to 
produce attenuated viruses which can be tested as candidate live 
vaccines. In assaying the ability of the viruses of the 
invention to infect mammals one can assay sera or liver of the 
infected mammal by RT-PCR to determine viral titer. In 
addition, the virulence phenotype of the virus produced by 
transfection of mammals with the sequences of the invention can 
be monitored by methods known in the art such as measurement of 
liver enzyme levels (alanine aminotransferase (ALT) or 
isocitrate dehydrogenase (ICD)) or by histopathology of liver 
biopsies . 

When used as a vaccine, the HCV or chimeric virions 
can be administered alone or in a suitable diluent, including, 
but not limited to, water, saline, or some type of buffered 
medium. The vaccine according to the present invention may be 
administered to an animal, especially a mammal, and most 
especially a human, by a variety of routes, including, but not 
limited to, intradermally, intramuscularly, subcutaneously, or 
in any combination thereof. Of course, it is understood that 
formulations or compositions comprising the HCV or chimeric 
virions of the invention may be used either therapeutically or 
prophylactically to treat or prevent the signs and symptoms of 
Hepatitis C. 

In a third embodiment, the nucleic acid molecule of 
the invention comprises an HCV E2 gene lacking the HVR1 region. 
Such a "AHVR1-E2 gene" may also be linked in tandem in the DNA 
construct with an HCV El gene. 

When contained alone or in tandem with the El gene, 
the AHVR1-E2 gene may be further modified at its carboxy- 
terminus to produce either a secreted or surface expressed 
AHVR1-E2 protein. 
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A AHVR1-E2 gene encoding E2 protein targeted to the 
cell surface is preferred. Such a construct may be constructed 
by fusing an endoplasmic reticulum signal sequence to the amino- 
terminus of the nucleic acid sequence which encodes the 
truncated E2 gene fused at its carboxy-terminus to a plasma 
membrane anchor sequence. 

By endoplasmic reticulum (ER) signal sequence is meant 
a nucleic acid sequence which encodes a continuous stretch of 
amino acids, typically about 15 to about 25 residues in length, 
which are known in the art to be generally located at the amino 
terminus of proteins and are capable of targeting proteins to 
the endoplasmic reticulum. Such ER signal sequences are known 
to those of skill in the art (see, for example, van Heijne, G. 
J. Mol. Biol. , (1985) 184:99-105) and those of skill in the art 
would understand that even though their amino acid sequences may 
vary, such ER signal sequences are functionally interchangeable. 
Examples of ER signal sequences which may be used in the 
chimeric genes of the invention include, but are not limited to, 
the 20-carboxy-terminal amino acids of the full-length HCV El 
protein (amino acids 364-383 of the HCV polyprotein) , which 
serves as the natural signal sequence of the E2 protein or the 
murine Ig kappa-chain V-J2-C signal peptide sequence contained 
in the pDisplay vector. 

Where the truncated HCV envelope protein is a 
truncated E2 protein, the approximately 30 carboxy-terminal 
amino acids of E2 have been identified to contain an ER 
retention sequence and its removal and replacement with a plasma 
membrane anchor sequence is believed to be critical for 
expression of the truncated E2 protein on the cell surface. 
Thus, the truncated E2 protein contains a truncation of at least 
the 20 carboxy-terminal amino acids of the full-length E2 
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protein, more preferably, a truncation of at least the 25 
carboxy-terminal amino acids, and most preferably, a truncation 
of at least about the 30 carboxy-terminal amino acids. By 
"plasma membrane anchor sequence" as used in the chimeric gene 
of the invention is meant a nucleic acid sequence which encodes 
an amino acid sequence that allows for retention of at least 
part of the protein in the plasma membrane of a cell. At a 
minimum, a plasma membrane anchor sequence encodes a sequence of 
hydrophobic amino acids of sufficient length to span the lipid 
bilayer of the plasma membrane. Such hydrophobic sequences are 
known in the art as transmembrane domains and are typically 
found at the carboxy-terminus of many proteins found on the 
surface of cells or virions. These transmembrane domains are 
typically at least 20 to 30 amino acids in length and are 
followed by charged cytoplasmic domains of varying lengths. 

It is therefore understood that the plasma membrane 
anchor sequence encoded by the coding sequence of the invention 
may contain in addition to a transmembrane domain of a virion or 
a protein found on the surface of a cell, a cytoplasmic domain. 

Perferably, the encoded plasma membrane anchor 
sequence is at least twenty amino acids in length, more 
preferably, from about 20 to about 100 amino acids in length, 
and most preferably, from about 30 to about 70 amino acids in 
length. Examples of plasma membrane anchor sequences include, 
but are not limited to, hydrophobic transmembrane domains of 
receptors such as those for insulin and for a number of growth 
factors including platelet-derived growth factor (PDGF) and 
epidermal growth factor (EGF) , as well as the transmembrane 
domains of viral proteins that are anchored in the lipid 
envelope of the intact virion such as the transmembrane domains 
of the vesicular stomatitis and rabies virus G proteins. 



13 



WO 01/21807 PCT/US00/25987 

Preferred plasma membrane anchor sequences for 
inclusion in the chimeric genes of the invention are sequences 
which encode the 50 amino acid transmembrane domain of the PDGF 
receptor as contained in the pDisplay vector described in the 
Examples, the - carboxy-terminal 64 and 37 amino acids 
respectively of the CD4 and decay accelerating factor (DAF) 
proteins (these sequences constitute the transmembrane and 
cytoplasmic domains of the CD4 and DAF proteins) and the 4 9 
carboxy-terminal amino acids of the VSV G protein (also 
constituting the transmembrane and cytoplasmic domains of the 
VSV G protein) . 

Of course, one of ordinary skill in the art would 
readily understand that other transmembrane domains suitable 
for use as plasma membrane anchor sequences in the chimeric 
genes of the invention are known or could be readily identified 
by carrying out carboxy-terminal deletions of known plasma 
membrane or viral envelope proteins (see, for example, Men et al 
(J. Virol. (1991) 65; 1400-1407). 

A AHVR1-E2 gene encoding a secreted E2 protein may be 
constructed by fusing an endoplasmic reticulum signal sequence 
to the nucleic acid sequence which encodes the truncated E2 
gene. In this case, the truncated E2 gene contains a truncation 
of at least the 31 carboxy-terminal amino acids (amino acids 
716-746), and more preferably, a truncation of at least the 85 
carboxy-terminal amino acids (amino acids 662-746) . 

Like the AHVRl-inf ectious nucleic acid sequences and 
the AHVRl-chimeric nucleic acid sequences described above, it is 
understood that the AHVR1-E2 nucleic acid sequences of the 
invention may lack all or part of the HVR1 sequence. The 
deletion of HVR1 is at least 5 amino acids in length, preferably 
10 amino acids, and most preferably the entire 27 amino acids of 
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HVR1. The AHVR1-E2 nucleic acid sequences of the invention may 
also contain further truncations at the carboxy-terminus of the 
E2 gene of HCV, and/or mutations at amino acid position 615 as 
described above. 

The present invention therefore relates to insertion 

of the nucleic acid molecules comprising the AHVR1-E2 gene of 
the invention into a suitable expression vector that functions 
in eukaryotic cells, preferably in mammalian cells. By suitable 
it is meant that the vector is capable of carrying and 
expressing a chimeric gene of the invention. The expression 
vector therefore contains at least one promoter and any other 
sequences necessary or preferred for appropriate transcription 
and translation of the AHVR1-E2 gene. Preferred expression 
vectors include, but are not limited to, plasmid vectors. 

The invention also relates to the use of expression 
vectors containing the AHVR1-E2 nucleic acid molecules of the 
invention as immunogens to produce protective antibodies to HCV. 
Direct transfer of the AHVR1-E2 nucleic acid sequences of the 
invention to a mammal, preferably a primate, more preferably a 
human, may be accomplished by injection by needle or by use of 
other DNA delivery devices such as the gene gun. Possible 
routes of administration of the expression vector include, but 
are not limited to, intravenous, intramuscular, intradermal, 
subcutaneous, intraperitoneal and intranasal. 

Since the existence of different genotypes with a low 
degree of homology within the envelope proteins diminishes the 
hope of identifying conserved neutralization epitopes (Bukh J, 
et al, Sem Liver Pis (1995); 15:41-63), it is likely that a 
polyvalent vaccine will be needed to generate broadly reactive 
neutralizing antibodies. Thus, in a preferred embodiment, 
nucleic acid molecules comprising AHVR1-E2 nucleic acid 
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sequences of isolates from multiple genotypes of HCV may be 
administered together to provide protection against challenge 
with multiple genotypes of HCV. 

Accordingly, those of ordinary skill in the art would 
readily understand that multiple copies of different AHVR1-E2 
nucleic acid sequences may be inserted into a single vector such 
that a host cell transformed or transfected with the vector will 
produce multiple envelope proteins. For example, a 
polycistronic vector in which multiple different AHVR1-E2 genes 
may be expressed from a single vector is created by placing 
expression of each gene under control of an internal ribosomal 
entry site (IRES) (Molla, a. et al. Nature, 356:255-257 (1992); 
Gong, S.K. et al. J. of Virol. , 263:1651-1660 (1989)). In a 
preferred embodiment, copies of different AHVR1-E2 nucleic acid 
sequences are inserted into multiple vectors and transformed or 
transfected into host cells so that multiple envelope proteins 
can be produced. 

The expression vectors containing the AHVR1-E2 nucleic 
acid sequences of the invention may be supplied in the form of a 
kit, alone, or in the form of a pharmaceutical composition. 

Suitable amounts of material to administer for 
prophylactic and therapeutic purposes will vary depending on the 
route selected and the immunogen (E2 gene with different 
deletions) administered. One skilled in the art will appreciate 
that the amounts to be administered for any particular treatment 
protocol can be readily determined without undue 
experimentation. The vaccines of the present invention may be 
administered once or periodically until a suitable titer of 
anti-HCV antibodies appear in the blood. A suitable amount of 
expression vector to be used for prophylactic purposes might be 
expected to fall in the range of from about 1 |ig to about 5 mg, 
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more preferably from about 100 ^g to about 5 mg, and most 
preferably from about 1 mg to about 2 mg. Such administration 
will, of course, occur prior to any sign of HCV infection. 
Further, one of skill in the art will readily understand that 
the amount of vector to be used will depend on the size and 
species of animal the vector is to be administered to. 

A vaccine of the present invention may be employed in 
sterile liquid forms such as solutions or suspensions. Any 
inert carrier is preferably used, such as saline or phosphate- 
buffered saline, or any such carrier in which the expression 
vector of the present invention can be suitably suspended. The 
vaccines may be in the form of single dose preparations or in 
multi-dose flasks which can be utilized for mass-vaccination 
programs of both animals and humans. Of course, specific 
adjuvants such as CpG motifs (Krieg, A.K. et al. (1995) Nature 
374:546 and Krieg et al. (1996)) J. Lab. Clin. Med. , 128:128) 
may prove useful with DNA-based vaccines or other vaccines. 

The DNA-based vaccines will normally exist as 
physically discrete units suitable as a unitary dosage for 
animals, especially mammals, and most especially humans, wherein 
each unit will contain a predetermined quantity of active 
material calculated to produce the desired immunogenic effect in 
association with the required diluent. The dose of said vaccine 
or inoculum according to the present invention is administered 
at least once. In order to increase the antibody level, a 
second or booster dose may be administered at some time after 
the initial dose. The need for, and timing of, such booster 
dose will, of course, be determined within the sound judgment of 
the administrator of such vaccine or inoculum and according to 
sound principles well known in the art. For example, such 
booster dose could reasonably be expected to be advantageous at 
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some time between about 2 weeks to about 6 months following the 
initial vaccination. Subsequent doses may be administered as 
indicated. 

The AHVR1-E2 nucleic acid sequences of the present 
invention can also be administered for purposes of therapy, 
where a mammal, especially a primate, and most especially a 
human, is already infected, as shown by well-known diagnostic 
measures. When expression vectors containing the chimeric genes 
of the present invention are used for such therapeutic purposes, 
much of the same criteria will apply as when they are used as a 
vaccine, except that inoculation will occur post-infection. 
Thus, when the expression vectors of the present invention are 
used as therapeutic agents in the treatment of infection, the 
therapeutic agent comprises a pharmaceutical composition 
containing a sufficient amount of the expression vector so as to 
elicit a therapeutically effective response in the organism to 
be treated. Of course, the amount of pharmaceutical composition 
to be administered will, as for vaccines, vary depending on the 
immunogen contained therein and on the route of administration. 

The therapeutic agent according to the present 
invention can thus be administered by subcutaneous, 
intramuscular, intradermal or intranasal routes. One skilled in 
the art will certainly appreciate that the amounts to be 
administered for any particular treatment protocol can be 
readily determined without undue experimentation. Of course, 
the actual amounts will vary depending on the route of 
administration as well as the sex, age, and clinical status of 
the subject which, in the case of human patients, is to be 
determined with the sound judgment of the clinician. 

The therapeutic agent of the present invention can be 
employed in sterile liquid forms such as solutions or 
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suspensions. An inert carrier is preferably used, such as 
saline, phosphate-buffered saline, or any such carrier in which 
the expression vectors of the present invention can be suitably 
suspended. The therapeutic agents may be in the form of single 
dose preparations or in multi-dose flasks, which can be utilized 
for mass-treatment programs of both animals and humans. Of 
course, when the expression vectors of the present invention are 
used as therapeutic agents, they may be administered as a single 
dose or as a series of doses, depending on the situation as 
determined by the person conducting the treatment. 

The invention also relates to transformation of host 
cells with nucleic acid molecules of the invention to produce 
host cells which express HVRl-inf ectious HCV, the AHVRl-chimeric 
HCV or the AHVR1-E2 protein. 

The invention further relates to the use of host cells 
expressing AHVRl-inf ectious HCV, the AHVRl-chimeric HCV or the 
AHVR1-E2 protein as immunogens to stimulate a protective immune 
response to HCV. 

The present invention, of course, also relates to 
AHVR1 E2 protein produced from the AHVR1-E2 gene constructs of 
the invention or obtained from the AHVRl-inf ectious HCVs or 
AHVRl-chimeric HCVs of the invention. 

The nucleic acids, polypeptides and viruses of the 
present invention can also be utilized in the production of 
antibodies against HCV. The term "antibody" is herein used to 
refer to immunoglobulin molecules and immunologically active 
portions of immunoglobulin molecules. Examples of antibody 
molecules are intact immunoglobulin molecules, substantially 
intact immunoglobulin molecules and portions of an 
immunoglobulin molecule, including those portions known in the 
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art as Fab, F(ab') 2 and F(v) as well as chimeric antibody 
molecules . 

Thus, the polypeptides, viruses and nucleic acid 
sequences of the present invention can be used in the generation 
of antibodies that immunoreact (i.e., specific binding between 
an antigenic determinant-containing molecule and a molecule 
containing an antibody combining site such as a whole antibody 
molecule or an active portion thereof) with antigenic 
determinants on the surface of hepatitis C virus particles. 

The present invention therefore also relates to 
antibodies produced following immunization with the nucleic acid 
sequences, viruses or polypeptides of the present invention. 
These antibodies are typically produced by immunizing a mammal 
with an immunogen or vaccine to induce antibody molecules having 
immunospecif icity for polypeptides or viruses produced in 
response to infection with the nucleic acid sequences of the 
present invention. When used in generating such antibodies, the 
nucleic acid sequences, viruses, or polypeptides of the present 
invention may be linked to some type of carrier molecule. The 
resulting antibody molecules are then collected from said 
mammal. Antibodies produced according to the present invention 
have the unique advantage of being generated in response to 
authentic, functional polypeptides produced according to the 
actual cloned HCV genome. 

The antibody molecules of the present invention may be 
polyclonal or monoclonal. Monoclonal antibodies are readily 
produced by methods well known in the art. Portions of 
immunoglobin molecules, such as Fabs, as well as chimeric 
antibodies, may also be produced by methods well known to those 
of ordinary skill in the art of generating such antibodies. 
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The antibodies according to the present invention may 
also be contained in blood, plasma, serum, hybridoma 
supernatants, and the like. Alternatively, the antibody of the 
present invention is isolated to the extent desired by well 
known techniques such as, for example, using DEAE Sephadex. The 
antibodies produced according to the present invention may be 
further purified so as to obtain specific classes or subclasses 
of antibody such as IgM, IgG, IgA, and the like. Antibodies of 
the IgG class are preferred for purposes of passive protection. 

The antibodies of the present invention are useful in 
the prevention and treatment of diseases caused by hepatitis C 
virus in animals, especially mammals, and most especially 
humans . 

In providing the antibodies of the present invention 
to a recipient mammal, preferably a human, the dosage of 
administered antibodies will vary depending on such factors as 
the mammal's age, weight, height, sex, general medical 
condition, previous medical history, and the like. 

In general, it will be advantageous to provide the 
recipient mammal with a dosage of antibodies in the range of 
from about 1 mg/kg body weight to about 10 mg/kg body weight of 
the mammal, although a lower or higher dose may be administered 
if found desirable. Such antibodies will normally be 
administered by intravenous or intramuscular route as an 
inoculum. The antibodies of the present invention are intended 
to be provided to the recipient subject in an amount sufficient 
to prevent, lessen or attenuate the severity, extent or duration 
of any existing infection. 

The invention also provides that the nucleic acid 
molecules, viruses, polypeptides and antibodies of the invention 
may be supplied in the form of a kit, alone or in the form of a 
pharmaceutical composition. 
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All scientific publication and/or patents cited herein 
are specifically incorporated by reference. The following 
examples illustrate various aspects of the invention but are in 
no way intended to limit the scope thereof. 

EXAMPLES 
Materials and Methods 

Construction of An Expression Vector AHVR1-7 Containing El and a 
Carboxy-Terminal Truncated E2 Which Lacks the HVR1 Region and 
the Carboxy-Terminal 31 Amino Acids of E2 

Deletion of the nucleotide sequence encoding the HVR1 
region was performed by fusion PCR using pCV-H77C, the 
infectious cDNA clone of HCV genotype la (Yanagi et al., 
1997;ATCC accession number PTA-157) . 



22 



WO 01/21807 PCT/US00/2S987 

Table 1 

Primers used for the construction of AHVR1* 



PRIMER SEQUENCE 
SEQ: ID NO: 3 

E2-Pstl: 5'-ACG CGT CTG CAG CTT AAT GGC CCA GGA CGC GAT GCT TG-3' 
SEQ: ID N0:4 

El-Bglll: 5'- ACG CGT AGA TCT TAC CAA GTG CGC AAT TCC TCG GGG-3' 
SEQ: ID NO: 5 

E1-383R (+10): 5' -TCA GTT GGA TAG CGT CGA CGC CGG CAA ATA GCA G-3' 
SEQ: ID NO: 6 

E2-411S (+10): 5'-CGT CGA CGC GAT CCA ACT GAT CAA CAC CAA CGG C-3' 
SEQ: ID N0:7 

HVRl-mutS: 5' -CTT GTA CCA TCA ATT ACA CCA TAT TC-3' 
SEQ: ID NO: 8 

HVRl-mutR: 5' -GAT AGT GCC AAT GCC TAT ACG GG-3' 
SEQ: ID NO: 9 

FUSION 1: 5' -CGT ATA GGC ATT GGC ACT ATC CTT GTA CCA TCA ATT ACA CC-3' 
SEQ: ID NO: 10 

FUSION2 : 5' -GGT GTA ATT GAT GGT ACA AGG ATA GTG CCA ATG CCT ATA CG-3' 
SEQ: ID NO: 11 

E2-364 NotI: 5'-TTT TTT TT G CGG CCG C AT GGT GGG GAA CTG GGC GAA GGT 

CC-3' 

SEQ: ID NO: 12 

E2-661-HindIII: 5' -ACG CGT AAG CTT CTA TTA CTC GGA CCT GTC CCT GTC TTC 
CAG- 3' 
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*Restriction sites within primers are underlined 

In the first step, the sequences encoding the El 
protein (aa 192-383) and a truncated E2 protein (aa 411-715) 
lacking the HVR1 region (aa 384-410) were PCR amplified from 
pCV-H77C. Briefly, 50 ng of cDNA were added to a master mix 
containing 4 pi of 5X Advantage KlenTaq Buffer, 1.25 pi of dNTP 
(10 mM), 1 pi of 10 pM sense primer [El-Bglll (SEQ ID NO: 4) for 
El and E2-411S(+10) (SEQ ID NO: 6) for E2], 1 pi of lOpM 
antisense primer [E1383R (+10) for El and E2-Pstl for E2] and 1 
pi of the Advantage KlenTaq polymerase mixture. Cycling 
conditions were 99°C for 1 min followed by 25 cycles of 99°C for 
35 sec, 67°C for 30 sec, 68°C for 3 min 30 sec. After 
purification of the PCR products, a fusion PCR was performed. 
Briefly, 2.5 pi of each PCR product were added to a master mix 
containing 10 pi of 10X Pfu buffer, 2 pi of 10 mM dNTP, 5 pi of 
10 pM sense primer (El-Bglll) (SEQ ID NO: 4), 5 pi of 10 pM 
antisense primer (E2-Pstl) (SEQ ID NO: 3), 1 pi of Pfu and 72 pi 
of H 2 0. Cycling conditions were 95 °C for 1 min, 67 °C for 1 
min, 72 °C for 3 min 30 sec (30 cycles). The fusion PCR product 
was digested with Eglll and PstI and cloned into the expression 
vector pDisplay (Invitrogen) . A clone with an insert of the 
correct size (AHVR1-7) was selected and sequenced. Sequence 
analysis of both strands of DNA confirmed that this clone 
contained the expected sequence of El and a truncated E2 lacking 
the HVR1, i.e., AHVR1-7 encodes amino acids 192-383 of the El 
protein fused to amino acids 411-715 of the E2 protein. 

Construction of An Infectious clone of HCV Lacking HVR1 

Construction of an infectious cDNA clone that was 
full-length except for the HVR1 was performed by digestion of 
the expression vector AHVR1-7 with Muni (which cuts at 
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nucleotide positions 1254 and 1983 of pCV-H77C) and cloning of 
the resulting fragment into the digested HCV cDNA clone H77C. 
Briefly/ a 3 pg sample of the infectious HCV cDNA clone H77C was 
digested with Muni for 3 hours and the enzyme was then 
inactivated at 65 °C for 20 min. The digested cDNA clone, 
lacking a portion of the El and E2 regions (nucleotides 1255 to 
1983 of pCV-H77C) , was dephosphorylated with calf intestinal 
alkaline phosphatase. A sample of AHVR1-7 was then digested 
with Muni for 3 hours. After inactivation of the Muni enzyme, 
the digested fragment was ligated into the digested pCV-H77C 
using standard procedures (Forns et al. f 1999). A clone 
containing the correct insert was selected, retransf ormed and 
large-scale plasmid DNA was prepared as previously described 
(Yanagi et al., 1997). The complete sequence of the HVR1 
deletion mutant [H77C (AHVR1) ] was the expected one, that is, 
identical to pCV-H77C but lacking the fragment from nucleotide 
positions 1491 to 1571 which encodes HVR1. 

In vitro transcription-translation analysis 

Non-linearized plasmid AHVR1-7 and the control plasmid 

E1E2-715 were used for in vitro transcription-translation. The 

AHVR1-7 plasmid contained the nucleotide sequence encoding El 

(aa 192-383) and E2 (aa 411-715) inserted between a leader 

sequence which targeted the HCV proteins to the secretory 

pathway and the transmembrane domain of the platelet-derived 

growth factor receptor (PDGFR) which anchored the HCV E2 

proteins to the plasma membrane in the expression vector 

pDisplay (Invitrogen) . The control plasmid contained the 

nucleotide sequence encoding El (aa 192-383) a carboxy-truncated 

form of E2 (aa 384-715) of pCV-H77C inserted between a leader 

sequence which targeted the HCV proteins to the secretory 

pathway and the transmembrane domain of PDGFR which anchored the 
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HCV E2 proteins to the plasma membrane in the expression vector 
pDisplay. Reactions were performed in 25 pi of the TNT Coupled 
Reticulocyte Lysate System (Promega) containing [S 35 ] methionine, 
with or without the addition of canine microsomal membranes at 
30 °C for 90 min. Total translation products were separated in 
12% SDS/PAGE and identified by autoradiography. 

Analysis of cell surface-expression of the E2 protein 

Huh7 cells grown in 4-well tissue culture chambers 
were transfected with plasmids E1E2-715 and AHVR1-7 described 
above. Immuno-f luorescence analysis was performed 48 hours 
after transf ection. Live cells were incubated for 30 minutes 
with a 1:100 dilution of rabbit hyperimmune serum (Lmf86) raised 
against a peptide encoding the carboxy-terminal 21 amino acids 
of HVR1 (aa 390-410 of pCV-H77C) or a rabbit hyperimmune serum 
(FOR-1) raised against a peptide within E2, but outside the HVR1 
(aa 517-535 of pCV-H77C) . After washing, cells were incubated 
with FITC-conjugated goat anti-rabbit IgG (1:100) for 30 min at 
37°C. After washing, slides were mounted and examined for 
immunofluorescence. For intracellular immunofluorescence 
staining , cells were fixed and permeabilized with cold acetone, 
as previously described (Forns et al. r 1999). 

Transf ection of a chimpanzee with H77C(AHVR1) RNA 

RNA was transcribed in vitro with T7 RNA polymerase 
from 10 pg of template plasmid H77C(AHVR1) linearized with Xbal 
as described previously (Yanagi et al., 1997). The quality and 
amount of RNA were analyzed by gel electrophoresis and ethidium 
bromide staining. Two transcription mixtures were each diluted 
with 400 pi of ice-cold phosphate-buffered saline without 
calcium or magnesium and then immediately frozen on dry ice and 
stored at -80°C. Within 24 hours, both transcription mixtures 
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were percutaneously injected into the liver of a chimpanzee 
(number 1590) under ultrasonographic control- The housing, 
maintenance, and care of the animals were in compliance with all 
relevant guidelines and requirements. 

Serum samples were collected weekly from the 
chimpanzee and monitored for serum levels of alanine 
aminotransferase (ALT), anti-HCV antibodies [second generation 
ELISA] and HCV-RNA [HCV Monitor test (Roche) and in-house RT- 
nested PCR (Bukh et al., 1998)]. Sequence analysis of the 
recovered virus was performed at different time points during 
follow-up. In short, genomic regions were amplified in RT- 
nested PCR (Bukh et al., 1998) with primers specific for the H77 
strain of HCV. 

Construction of vectors that Express Mutant E2 Protein 
on the Cell Surface or in a Form That Is Secreted 

A. Cell Surface AHVR1-E2 Vectors 

Clones E1E2-715 (amino acids 192-715) and AHVR1-7 
(amino acids 192-383 of El in frame with amino acids 411-715 of 
E2) were produced as described above. 

Clone AHVRl-mut5 (amino acids 192-383 of El in frame 
with amino acids 411-715 of E2 except that amino acid 615 is 
changed from a leucine to a histidine was produced as follows. 

Primers El-Bglll (SEQ ID NO: 4) and HVRl-mutR (SEQ ID 
NO: 8) (containing a T->A mutation at nucleotide position 2185: 
see Example 4) were used to amplify a fragment encoding the El 
protein and a portion of the E2 protein using Advantage KlenTaq 
Polymerase mix, as described above. Primers HVR-1 mutS (SEQ ID 
NO: 7) and E2-Pstl (SEQ ID NO: 3) were used to amplify the rest 
of the truncated E2 protein by using AmpliTaq Gold DNA 
polymerase. In a second step, both products were fused by an 
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amplification reaction that contained a mixture of both external 
primers (El-BglH (SEQ ID NO: 4) and E2-Pstl (SEQ ID NO: 3) and 
the two fusion primers SEQ ID NO: 9 and SEQ ID NO: 10) (Table 1) , 
by using Advantage KlenTaq Polymerase mix. The fusion product 
was digested with Bglll and PstI and cloned into the digested 
pDisplay vector to produce AHVRl-mut5. 

Clones E1E2-715, AHVR1-7 and AHVRl-mut5 were used to 
transfect Huh7 cells and to determine their pattern of 
reactivity against a panel of 5 rabbit hyperimmune sera and 12 
human monoclonal antibodies (Cardoso et al., 1998, Inchauspe et 
al. 1998) by immunofluorescence in live and f ixed/permeabilized 
cells _ 

B. Secreted AHVR1-E2 Vectors 

In addition, three different expression vectors 
encoding a secreted form of the E2 protein were constructed: E2- 
661, encoding the ER signal sequence of El (aa 364-383 of pCV- 
H77C) and a carboxy-terminal truncated E2 protein (aa 384-661 of 
pCV-H77C); E2-661 AHVR1, encoding the same protein with a 
deletion of the HVR1 region (aa 384-410 of pCV-H77C) ; and E2- 
661 AHVRl-mut, encoding the same protein with a deletion of the 
HVR1 and the replacement of leucine for histidine at amino acid 
position 615. In this case, amplification was performed from 
templates E1E2-715, AHVR1-7 and AHVRl-mut5, respectively, with 
primers shown in Table 1. PCR products were cloned into the 
expression vector pcDNA3.1(-) (Invitrogen) . Clones E2-661, E2- 
661 AHVR1, and E2-661 AHVRl-mut were used to express soluble E2 
protein in vitro. Briefly, one microgram of each of the 
plasmids was used for in vitro transcription-translation. 
Reactions were performed in 25 pi of the TNT Coupled 
Reticulocyte Lysate System (Promega) containing [S 35 ] methionine, 
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with the addition of canine microsomal membranes at 30°C for 90 
min. 



Example 1 

In vitro transcription-translation analysis 

To determine whether cleavage between El and E2 was 
affected by the deletion of the HVR1 region, expression vectors 
E1E2-715 (encoding the complete El protein and a cell-surface E2 
protein lacking the carboxy-terminal 31 amino acids of E2) and 
AHVR1-7 (encoding the complete El protein and a cell-surface E2 
lacking the HVR1 and carboxy-terminus 31 amino acids) were 
assayed for in vitro protein synthesis. In vitro transcription- 
translation was performed with the TNT Coupled Reticulolysate 
System using [S 35 ] methionine . In the absence of microsomal 
membranes, translation products of the expected size were 
obtained for E1E2-715 and AHVR1-7 (Figure 2) . When microsomal 
membranes were added, cleavage between El and E2 was observed in 
both cases (Figure 2) and f as expected, the E2 protein derived 
from AHVR1-7 was slightly smaller in size. These results 
indicate that cleavage between El and E2 is not affected by the 
deletion of the N-terminus (HVR1) of the E2 protein. 

Example 2 

Analysis of cell-surface expression of the E2 protein 

To analyze whether a truncated E2 protein lacking the 
HVR1 region could be targeted to the cell surface, Huh7 cells 
were transfected with E1E2-715 and AHVR1-7 . After transf ection, 
f ixed/permeabilized cells and live cells were analyzed for 
expression of E2 by immunofluorescence. Up to 20% of cells 
transfected with E1E2-715 were positive when stained with a 

rabbit hyperimmune serum Imf 86 or FOR-1. Both fixed 
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/permeabilized and live cells were stained. A similar 
proportion of cells transfected with AHVR1-7 was positive when 
stained with rabbit hyperimmune serum FOR-1 (raised against aa 
517-535) . Again, both f ixed/permeabilized and live cells were 
stained. In contrast, there was no staining with rabbit 
hyperimmune serum lmf86 r which was raised against aa 390-410 
(within HVR1) . These results demonstrated that transport of the 
E2 protein to the cell surface was not affected by deletion of 
the HVR1 region, suggesting that deletion of the N-terminal 
domain of E2 did not result in misfolding of this protein. 

Example 3 

Transfection of a chimpanzee with RNA of H77C(AHVR1) 

To determine the infectivity of H77C(AHVR1), RNA 
transcripts were injected into the liver of chimpanzee 1590. 
HCV-RNA was detected by RT-nested PCR at week one post- 
inoculation (p.i.) and the animal remained HCV-RNA positive 
throughout the entire follow-up period of 22 weeks (Figure 3) . 
The quantitative HCV Monitor test was negative until week 10 
p.i. The genome titer during the first 7 weeks was □ 10 genome 
equivalent (GE)/ml. However, beginning at week 8, the titer 
progressively increased over time {Table 2 and Figure 3) . 
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Table 2 



Chimpanzee 1590 Infected with H77C(AHVRl) 
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t loglO HCV GE titer by in-house RT-PCR 

* Amplicor HCV monitor (number of genome copies/ml) (Roche Diagnostics) 

Nucleotide sequence corresponding to Yanagi 1997 
Note: Lower case letters represent minor species 



Serum liver enzyme values remained normal during the 
follow-up. The second generation anti-HCV test remained 
negative at least through week 22 p.i. 

The follow-up period for chimpanzee 1590 has been 
extended to 72 weeks. Chimpanzee 1590 remained HCV-RNA positive 
throughout the 72 week follow-up period. Thus, the chimpanzee 
developed a chronic infection. In addition, the genome titer 
reached 10 3 - 10 4 GE/ml at weeks 11-14 (see Table 2) and remained 
at 10 3 - 10 4 GE/ml throughout the remainder of the 72 week 
period. 

At weeks 51 and 52 p.i., the relative genome titer of 
HCV lacking HVR1 was also assessed in serum, PMBC, and liver 
obtained from the transfected chimpanzee. HCV RNA was not 
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detected in PBMCs, but the HCV titers determined from liver 
tissue at weeks 51 and 52 p.i. were higher than the titers 
determined from the serum. 

The 72 week follow-up period also revealed that the 
transfected chimpanzee became positive for antibodies to second 
generation ELISA HCV at week 37 and third generation RIBA 
(Chiron) confirmed the presence of antibodies to C22(core), C33- 
c (NS3), and clOO (NS3-NS-4). 

Finally, CD4+ and CD8+ T cell responses were also 
measured. An HCV-specific proliferative CD4 + T cell response to 
C22 (core), C33-C (NS3), clOO (NS3-NS4 ) and/or NS5 antigens of 
HCV was detected in the peripheral blood mononuclear cells 
(PBMC) beginning at week one post-inf ection and throughout the 
follow-up period. During most weeks, however, only a mono- 
specific response to the core was detected. 

Peripheral CD8+ T cell (CTL) responses were tested at 
weeks 5, 9, 14 , and 18 by an in vitro peptide stimulation assay. 
CTL were detected with two NS4 peptides at weeks 9 and 14 and 
with one of these at week 18. A transient and monospecific 
(NS5) proliferative T cell response was detected in the liver at 
weeks 14 and 15. Although serum liver enzyme values remained 
normal during the entire follow-up, necroinf lammatory changes 
indicative of hepatitis were detected in liver biopsies during 
weeks 34-40. 

Example 4 

Analysis of the HCV-RNA recovered from chimpanzee 1590 

Sequence analysis of the HCV-RNA recovered from 
chimpanzee 1590 was performed at week 13. The entire ORF was 
amplified by RT-nested PCR of overlapping gene regions. Direct 
sequence analysis of both strands of DNA from the PCR products 
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confirmed that the recovered virus lacked the HVR1 region. 
Interestingly, three nucleotide mutations were identified; all 
three mutations were non-synonymous i.e. resulted in an amino 
acid change (Table 2) . One mutation was located within the E2 
region (corresponding to aa position 615 of H77C) and generated 
a change from leucine to histidine. Another mutation was 
located within the serine-protease domain of NS3 (corresponding 
to aa position 1143 of H77C; arginine to histidine) . Both 
mutations are at positions that are universally conserved among 
all HCV genotype reference strains (Bukh et al., 1998). 
Finally, a third mutation was observed in the NS5B RNA- 
polymerase domain (corresponding to aa position 2875 of H77C; 
glutamic acid to aspartic acid) . Although this is not a 
universally conserved position, a substitution of aspartic acid 
has not been described previously. Thus, all three mutations 
were unique. 

Two additional mutations were identified upon 
rereading of the week 13 sequence: mutation at 514 (valine to 
methionine) and a synonymous mutation at amino acid 2631. 
Sequence analysis of the HCV-RNA recovered from chimpanzee 1590 
was also performed at week 24 and five nucleotide mutations were 
identified; four of which were non-synonymous. (Table 3) . 
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Table 3 

Evolution of HCV lacking HVR1 in chimpanzee 1 590*. We performed sequence analysis of the entire open reading frame of the virus recovered from 
chimpanzee 1590 at weeks 13 and 24 post-transfection. We confirmed that the HVR1 region was not present and identified differences from the wild-Type 
cDNA clone as indicated. 
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•Nucleotide <m) and amino acid (*a) positions of HT7C . Sequence of H77C(AHVRI) cDNA is shown on top. Dominant sequences recovered from the chimpanzee are shown in capital 
letters. Minor sequences recovered from the chimpanzee are shown in lower case tetters, ttogio titer determined by RT-nested PCR on 10-fold serially diluted extracted RNA. Jlog,. 
titer det ermined by second generation Monitor test (Roche). 
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Four of the mutations were identical to the mutations 
identified at week 13 and one of the week 13 mutations (the NS3 
mutation at amino acid 1143) reverted to wild-type. The single 
new mutation identified at week 24 was located within the serine 
protease domain of NS3 (corresponding to aa position 1456 of 
H77C; threonine to methionine) . 

To analyze a possible relationship between the 
increase in viral titer and the appearance of the four non- 
synonymous mutations identified at week 13, sequence analysis 
was performed from regions encompassing the described mutations. 
During weeks 4-7, direct sequence analysis did not reveal 
evidence of the mutated nucleotides at the four positions. 
Between weeks 8 and 14, coinciding with the first increase in 
viral titer, direct sequence analysis revealed the presence of 
amino acid substitutions at four positions (two in E2, one in 
NS3, and one in NS5B) (See Table 3). Therefore, there was a 
temporal association between the appearance of these mutations 
and an increase in viral titer (See Tables 2 and 3) . 

Example 5 

Transmission of HCV lacking HVR1 to a second chimpanzee 

To investigate whether the HCV mutant lacking HVR1 was 
transmissible, a naive chimpanzee (#96A008) was inoculated 
intravenously with 90 ml of plasma taken at week 4 from 
chimpanzee 1590, before appearance of consenus mutations in that 
chimpanzee (Table 3) . The virological and immunological 
correlates of HCV infection in chimpanzee 96A009 are shown in 
Fig. 4. Serum HCV-RNA was first detected at week 2 p.i., with a 
genome titer of 10 1 GE/ml and the titer increased to peak levels 
of 10 4 -10 5 GE/ml during weeks 3-9. The infection was resolved at 
week 18 p.i. Antibodies to HCV were not detected. A peripheral 
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mono-specific proliferative CD4 + T cell response against core 
(C22) was detected during weeks 4-13 p.i.; a multi-specific 
response against NS3 (C33-C) , NS3-NS4 (clOO) and NS5 was 
detected at week 18. The chimpanzee mounted an early and multi- 
specific peripheral CD8+ T cell response to a total of seven 
different epitopes (representing core, NS3, NS4 and NS5) at the 
weeks tested (weeks 4, 7, 11, 15 and 19). Finally, a multi- 
specific (C33-C, clOO and NS5) sustained proliferative CD4 + T 
cell response was detected in the liver from week 8 p.i. Liver 
enzyme values were marginally elevated during weeks 10-18 p.i. 
and necroinf lammatory changes were detected in liver biopsies 
during weeks 11-19 p.i. 

Sequence analysis of the entire open reading frame of HCV 
recovered from the chimpanzee 96A008 at weeks 4 and 9 p.i. 
showed that the transmitted virus lacked HVR1. Compared with 
the cDNA clone of that mutant, four nucleotide substitutions 
were identified in the virus and all four mutations were non- 
synonymous. (Table 4) . 
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Table 4 

Evolution of HCV lacking HVR1 in chimpanzee 96A008*. We performed sequence analysis of the entire open reading frame of the virus 
recovered from chimpanzee 96A008 at weeks 4 and 9 post-inoculation. We confirmed thai the HVR1 region was not present and identified 
differences from the wikMype cPNA clone as indicated. 
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•Nucleotide (nt) and amino acid (aa) positions of H77C. Sequence of H77C(AHVR1) cDNA is shown on top. Dominant sequences recovered 
from the chimpanzee are shown in capital letters. Minor sequences recovered from the chimpanzee are shown in lower case letters, tlogw titer 
determined by RT-nested PCR on 10-fold serially diluted extracted RNA. flogio titer determined by second generation Monitor test (Roche). 



Sequence analysis of regions encompassing the described 
mutations showed that a mutation in E2 and NS3 each had replaced 
the wild-type nucleotide by the time the virus was first 
detected (week 2) . An identical change in E2 had occurred in 
the transfected chimpanzee (Table 3) . A change in El appeared 
as the quasispeicies at week 2 but had replaced the wild-type 
nucleotide completely by week 3. The other mutation, in NS3, 
appeared at week 3 and persisted as a quasispecies during weeks 
3-10 and then reverted to wild-type. This latter mutation had 
been seen also in the transfected chimpanzee. (Table 3). 
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Example 6 

Analysis of potential conformational changes 
within E2 after L615H substitution 

Sequence analysis of HCV recovered from chimpanzee 
1590 demonstrated that a non-synonymous mutation occurred within 
the E2 protein during follow-up (nt 2186 of pCV-H77C; amino acid 
position 615) - Since a change from a non-polar amino acid 
(leucine) to a charged one (histidine) within E2 might change 
its conformation, Huh7 cells were transfected with E1E2-715 
(intact E1-E2), AHVR1-7 (the deletion of HVR1) or AHVRl-mut 
(deletion of HVR1 along with the mutation at position 615) , and 
the transfected cells were stained for E2 with a panel of 4 
rabbit hyperimmune sera and 12 human monoclonal antibodies 
(Cardoso et al. 1998; Inschauspe et al., 1998). 
Immunof luoresecence analysis showed that there was no 
significant change in the patterns of reactivity among the three 
different forms of E2 targeted to the cell surface (Table 5) . 
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Table 5 

Effect of deletion of the HVR1 region and mutation of 
E2 615 (leu-his) on the reactivity of cell surface-expressed E2 
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-RHS: rabbit hyperimmune sera. 

-Amino acid position within the H77C sequence of the peptides used to 
generate RHS are indicated in parenthesis. 

-The remaining 12 antibodies are human monoclonal antibodies (Cardoso et 
al- 1998; Inschauspe et al. t 1998). 

-Scoring system from - to ++++. - means no staining and ++++ means 
strong staining. 
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CLAIMS 

1. A nucleic acid molecule which encodes a human 
hepatitis C virus lacking the hypervariable region 1 of HCV 
envelope 2 protein, said molecule capable of expressing said 
virus when transfected into cells. 

2. The nucleic acid molecule of claim 1, wherein the 
molecule encodes at least one of the following amino-acids, a 
histidine at amino acid positions 615, 1143 and 1310, an 
aspartic acid at position 2875, a methionine at amino acid 
positions 514 and 1456, and an asparagine at amino acid position 
252. 

3. A nucleic acid molecule comprising a chimeric 
virus genome, said genome being a pestivirus or flavivirus 
genome in which the structural region of the pestivirus or 
flavivirus has been replaced by the structural region of a 
hepatitis C virus genome which lacks hypervariable region 1 of 
HCV envelope 2 protein. 

4. The nucleic acid molecule of claim 3, wherein at 
least one envelope gene from the structural region of the 
pestivirus or flavivirus genome has been replaced by the E2 gene 
from the structural region of a hepatitis C virus genome, said 
HCV E2 gene lacking hypervariable region one. 

5. The nucleic acid molecule of claim 3, wherein the 
structural region of the hepatitis C virus genome encodes at 
least one of the following amino-acids, a histidine at amino 
acid position 615, an asparagine at amino acid position 252, and 
a methionine at position 514. 

6. The nucleic acid molecule of claims 3-5, wherein 
the genome is a pestivirus genome. 
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7. The nucleic acid molecule of claims 3-5, wherein 
the pestivirus genome is a bovine viral diarrhea virus genome. 

8. The nucleic acid molecule of claims 3-5, wherein 
the genome is a flavivirus genome, 

9. The nucleic acid molecule of claim 8, wherein the 
flavivirus genome is the genome of a dengue virus, 

10. A DNA construct comprising a nucleic acid 
molecule according to claim 1. 

11. A DNA construct comprising a nucleic acid 
molecule according to claim 3. 

12. An RNA transcript of the DNA construct of claim 

10. 

13. An RNA transcript of the DNA construct of claim 

11. 

14. A host cell transformed or transfected with the 
DNA construct of claim 10. 

15. A host cell transformed or transfected with the 
DNA construct of claim 11. 

16. A host cell transformed or transfected with RNA 
transcript of claim 12. 

17. A host cell transformed or transfected with RNA 
transcript of claim 13. 

18. A hepatitis C virus envelope 2 polypeptide 
produced by the cell of claims 14, 15, 16 or 17. 

19. A hepatitis C virus produced by the cell of 
claims 14 or 16. 

20. A chimeric virus produced by the cell of claims 

15 or 17. 
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21. A hepatitis C virus whose genome comprises a 
nucleic acid molecule according to claim 1. 

22. A chimeric virus whose genome comprises a nucleic 
acid molecule according to claim 3, 

23. A host cell infected with the virus of claim 21. 

24. A host cell infected with the virus of claim 22. 

25. An envelope 2 polypeptide encoded by the nucleic 
acid sequence of claims 1 or 3. 

26. Antibody to the polypeptide of claim 25. 

27. Antibody to the hepatitis C virus of claim 21. 

28. Antibody to the chimeric virus of claim 22. 

29. A composition comprising a polypeptide of claim 
25 suspended in a suitable amount of a pharmaceutical^ 
acceptable diluent or excipient. 

30. A composition comprising a virus of claims 21 or 
22 suspended in a suitable amount of a pharmaceutical^ 
acceptable diluent or excipient. 

31. A composition comprising a nucleic acid molecule 
of claims 1 or 3 suspended in a suitable amount of a 
pharmaceutically acceptable diluent or excipient. 

32. A method of immunizing a mammal, said method 
comprising immunizing said mammal with the composition of 
claim 29. 

33. A method of immunizing a mammal, said method 
comprising immunizing said mammal with the composition of 
claim 30. 
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34. A method of immunizing a mammal, said method 
comprising immunizing said mammal with the composition of 
claim 31. 

35. Antibodies produced by the method of claim 32. 

36. Antibodies produced by the method of claim 33. 

37. Antibodies produced by the method of claim 34. 

38. T cells reactive with the virus of claims 21 or 
22, said T cells produced by the method of claim 33. 

39. T cells reactive with the virus of claims 21 or 
22, said T cells produced by the method of claim 34. 

40. A chimeric gene comprising, in 5 1 to 3' order: 

(i) an endoplasmic reticulum signal sequence; 
and 

(ii) a coding sequence which encodes a hepatitis 
C virus (HCV) envelope 2 protein lacking a) 
hypervariable region 1 of the full-length 

envelope 2 protein and b) at least the 30 

carboxy-terminal amino acids of the full-length 

envelope 2 protein. 

41. The chimeric gene of claim 40, wherein the 
envelope 2 protein coding sequence lacks about the carboxy 
terminal 85 amino acids of the full-length envelope 2 protein. 

42. The chimeric gene of claim 4 0, wherein the 
endoplasmic reticulum signal sequence of i) is amino acids 364- 
384 of the HCV El gene. 

43. The chimeric gene of claim 40, wherein the gene 
further comprises a plasma membrane anchor sequence 3' of the 
envelope 2 protein coding sequence of ii) . 
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44. The chimeric gene of claim 43, wherein an El 
coding sequence is located between the ER signal sequence of i) 
and the envelope 2 coding sequence of ii) . 

45. An expression vector comprising the chimeric gene 
of claim 40. 

46. An expression vector comprising the chimeric gene 
of claim 43. 

47. A host cell transformed or transfected with the 
expression vector of claim 45. 

48. A host cell transformed or transfected with the 
expression vector of claim 46. 

49. A method for expressing a secreted hepatitis C 
virus envelope 2 protein lacking hypervariable region, said 
method comprising transforming a host cell with the expression 
vector of claim 4 5 under conditions which permit expression of 
the secreted envelope 2 protein. 

50. A method for expressing a hepatitis C virus 
envelope 2 protein lacking HVR1 on the surface of a cell, said 
method comprising transforming a host cell with the expression 
vector of claim 4 6 under conditions which permit expression of 
the envelope 2 protein on the cell surface. 

51. An envelope 2 protein produced by the method of 

claim 49. 

52. An envelope 2 protein produced by the method of 

claim 50. 

53. A method for immunizing a mammal comprising 
administering the expression vector of claim 45 to the mammal in 
an amount effective to stimulate the production of protective 
antibodies to HCV. 
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54. A method for immunizing a mammal comprising 
administering the expression vector of claim 46 to the mammal in 
an amount effective to stimulate the production of protective 
antibodies to HCV. 

55. A method for immunizing a mammal comprising 
administering the envelope 2 protein of claim 51 to the mammal 
in an amount effective to stimulate the production of protective 
antibodies to HCV. 

56. Antibodies produced by the method of claims 53 , 

54 or 55. 

57. A pharmaceutical composition comprising the 
antibodies of claim 56. 

58. A pharmaceutical composition comprising the 
expression vector of claims 45 or 46. 

59. A pharmaceutical composition comprising the 
protein of claim 51. 

60. The nucleic acid molecule of claim 2, wherein the 
molecule encodes a methionine at amino acid position 514. 
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1234567890 1234567890 1234567890 1234567890 1234567890 

GOCftGOCOOC 1GA1GGHQ3C gOOTOCAC CMGftATCfiC H XU-'IUllA 50 

03AACEOG 1CTK303CA GAAAG03ICT AGGCAIQ333 TERGaffiTCRG 100 

■lUlUU l UC A G CCTZCAQGAC QGGBGBGOCft, TftGIQSECIG 150 

CQGAftCOQGfr GAATK30CAG GO3A03333 'lULTl ' lLTlU 200 

GAEAAA0335 CIO^IGCCT Q3ftGftTnQ3 GJjlULUJGC QGRAGftCIGC 250 

TAG333AGEA GIGTIQ33TC G03MAQ33T TIGIQCTACT G0Cn5ffiAQ3 300 

UlULTlULiaA GIG0CXXX33G A3S1U1UUJA GRODSrGCAC CaiGaQCAOS 350 

CK2AAGAAA AAQCAAA03T AACACCAAOC <3ID333CaCA. 400 

GGftOGICftAG TTO0O3GSK3 GQ33ICAGAT UUl'lUJlUjA. GlTlflLTlUr 450 

1Q00Q0QCAG GQQOOCIftGA. TIG33IGIQC Q3333A0GAG GAfiSOTX 500 

GAGGGGflGGC AAOJILXaftGS TAG&03ICAG GL'iMUULLA AG3CAD3I03 550 

G0Q33AI33X: AQ3AOHQ33 CICAQ33333 GiaOJLTlUS OJ LL VLLTA IG 600 

GCAfflGBGOG ITGQGULalUS GOGQSfflGGC ' IOJIUIL ' IU, ' 003IGGCTCT 650 

aauacT&ocr ooaaxccac A^anm; cuimjimj ocaAinQas 700 

TAAQ3ICATC C33K3033ZET CGCXJjKXZTC AIQXtJiACA. 750 

TOULUJlUbT 039333003* CHQGAQQGS CIGOCRGGGC O^lUUOJtJftT 800 

0303103333 TTCIQGftaGft. a330GIGftAC TaiGCAftCfiG GGAft ULTlUL ' 850 

■i mnuuiLT TiciCimtT ' iultil ' iuul ' cc kc ' il ' ilt toocigrcts 900 

TQ033QCTIC AQCXTUiOCAA. GTG332AATT CCTUGQGQCr ITAaCAIG'lC 950 

OJLXJEftACIC GACTATIGIG TA03AG3333 CD3MG0GAT 1000 

CClGCACaCT C03333TGIG TO3CTI003T T0333RQQ3T AAO30CTO3A 1050 

Q3IUIT333T G333SIGA0C O3CA033IQG 0C3££ftG33A 033CAAACIC 1100 

aX3£aftOQC AJJLT1LLIA03 TCfiSMCGAT C1ULT1U1ULJ Q3AGQ30CRC 1150 

(X'lci u c im Goociciaos romaoCT giojuu^ict uiumuriu . 1200 

TT03TCAACT UITUAUUIUU TC033ZAG32 G3ZACIQ3AC GAD3CAAGAC 1250 

TQCAKTIGIT CiML'lMOJ CQQDCKEflA A0333ICAIC 1300 

QGB3M13MG AIGAACIGGT 033ZTR03QC AQ0GTIQ3IG GT^QCICAQC 1350 

TXTOOOSAT O3CECAAG0C AiUALLUACA. TGAIOX'IGS TGC'ICftCIGG 1400 

GSfiGIOriGS 0333CAIAGC (JIMTIL'HJC AIQ31G333A ACIQ3333AA 1450 

G3TCCTQ3IA G'lUl'lUL'lUC TATTI03333 03I03A030G ATOCAACIGA. 1500 

TCftACAOCAA CD32AGTIG3 OOOCAATA. Q2ACG30CIT GAATIG3RAT 1550 

GAAAQCCTTA. ACA0333CIG GTI£QCAQ33 CICTICIKEC AACACAAATT 1600 

CAAL'ILTILA G30IGID2IG AGAQ3TIQ3C CAQCIG333A ( 1* x Ti'APO? 1650 

AlTnUUOJA. G33CIQ3QGT OdATCAGIT MG32AAQ33 AAQ0330CIC 1700 

GAOe^ACDCC OCmjiuuiU GCACEA03CT CCAAGADCTT GIQ3CATIGT 1750 

GDC032AAAG AGCGIGTOIG QQ3033IMA TIGCTICACT (XXZAQ00003 1800 
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FIG. IB 
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1GGIG3IG3G AMEAOCGaC AQGID33Q03 (EOCiaCEIA CM3CT3333T 1850 

ocAAAnxam osgaigiuit miujmac aacaceaqgc cftcaaciujj isoo 

CRfinGsnc Gsrroraocr qsegaacic aaciqgatic accaaagtot 1950 

UUUGAG03X COCTIGIGIC ATO33AQ33S *IQ33CAACAA CRt X TlU L 'lC 2000 

TOO30CACTG MTOCTKD3 CAAACAiaCG GAAG030T ACId033IG 2050 

ajUL'iaoasr cqciqsaiia caoxaqsig oegstogac uxxnsmm. 2100 

OJL'mUGCA CUftlULTlUr AOCATCAAIT ACAOGATATT CAAAGICAG3 2150 

A1UUAULS1UG GAGGGGjIOGA. GCAGAQ3CTC GAAGOaTCT QCAACU3GAC 2200 

GOGQGGCJGAA. 03CIGIGATC TOGAAGACAG QGACAOJIUJ GAQZTCAQCC 2250 

CUriUL'lUCT GIOGAOCACA CAGn33CAQ3 TCETTOCGIG TIL'ITICACG 2300 

AOOJJLULXJAG ULTib'lLLAC O330CICATC CAOl'lOCAOC AGAACATTCT 2350 

GGAmiUUAG TOCTIGiaaS GGXJEAQ3GTC AAflC m aGOj ' l U L 'lUJUJL A 2400 

TEAAGIG3GA GTAC33IC3GIT CDCTSTTOC T1L1ULTJ.UJ AGA0333GQC 2450 

UIL ' IUL ' IULT GLTIUIU^T GA UUl ' iK. ' lL ' AIA103ZAAG OQSfiG3CGQC 2500 

TIT33AGAAC CTOSmAIAC TCAA3QCAGC ATO3CIQGDZ QQ3AD3CZA03 2550 

GICTTOIGIC CTIUL'IULJIU TICTICIGCT T lUm i ULJl A 1CIGAAG3GT 2600 

AGGJIG33IGC OOGEAGOQGT CTAQ3D3CTC TA033GAH3T G G OC I C IU L T 2650 

UL ' IUL ' ILL ' IG CK3GCGTTGC CICAOCX33QC AIA033OG GACA033AGG 2700 

'1UUUUULU1C GK3U33333Z GflTCTICnG lUUJbTliA AT G3COCIUACT 2750 

CIGID3CEAT ATIACAAQ33 CTAIAICAQC TOGJIGGATOT G3H33LT1CA 2800 

GnamciG aqcagagiag AAocacAAcr qcacuiuiuu Grroacmr: 2850 

TCAAOGTID33 G33333333C GAIGOOGnCA 1UITACICAT CJlUlUi!ftb'i!A. 2900 

CACXXEAOCC T33CATTIGA CATCAQ2AAA CTACIGCIQ3 U Lm U l 'lL U S 2950 

ALXJULTl'lULJ A3TCITCAAG COOTIGCT TAAAGIDGCC TALTlULjlUL' 3000 

Q03TICAAQG (JLTIL'IULUG A3CIQOQCX3C TAQCG333AA GAUaGOCOGA 3050 

QGTCATER03 TOCAAATOQC CAICATCAAG TTAQ33333C TTACT3GCAC 3100 

CrAIGIGIAT AAOCAIC1CA. OOJL\LLTlUi AGACIGQQOG CACAAOQGDC 3150 

1G0GAGA3CT GQ003IGGCT GIG3AACCAG ' lUjlLTlL ' lU CCCAAIGGAG 3200 

AOCAAGCICA TCACG7B3333 GQCAGAIAOC QOCX3O3IQ0G GIGACATCAT 3250 

CAACDQCTIG O033IUIUIG 0333TAG333 OCAQGAGAIA CIULTIUJLL ' 3300 

CAG0CEAO3S AA3G3ICTOC AAQQ03IG3A. QGTIGCIG3C Q3XA1CA0G 3350 

G0GrEAO3CXX: AQCAGACEAG AGGCJCIOl'iA QG3IGEA3RA TCACCAGOCT 3400 

GACIG3XGG GACAAAAAOC AAGIQGAQ33 TCAGG'IQJAG ATO3IGICAA 3450 

L'lGCTAOOCA AADCTIDCIG GCAACGIGCA. TCAA3GQGCT A3GCTGGACT 3500 

GICTBOCACE GQQ003GAAC GAG3AOCATC GGATCAD32A AG3GICCIGT 3550 

CA3GCAGAIG TA1A02AATG TQG&DCAAGA OLT1U1U33C TGCCCCGOTC 3600 
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CICAAQ3TIC CCQ2TCATIG ACAOX'IUIA CCIQ333CIC CK333RCCIT 3650 

•mOCIGSICA G3AQ32A03C OSJOGICftlT OOGHIUJUOC Q303RQ3IGA. 3700 

TM3ZM3333T N3COTGCTJT OXOXmsi OflTICaaC TTCAAAQ3CT 3750 

0CTO3Q3933 TCCQCIGTTG TO003333QG SOQ3333T Q33XTATIC 3800 

AQ33303093 TCIQCACOCE 1Q3BGfIQ3CT AAAQQ33IG3 A L ' lTimULL ' 3850 

T3IQ3AGAAC CTBG3SO& CCaiGAGATC Cm33IGTTC J033ACAACT 3900 

CUIL'ILOCC AGCAGTOJUL" CAGAULTIUC AG3IQ33GCA OJlULmULT 3950 

ODCAOCQ3CA. GCQ3I&AGAG CAOGAAGGTC 033XTO33T A03ZAQ3GCA 4000 

G33CTACAAG GIGITOGflQC TCAA03X1U TGTIQCIQCA AQ3CIG33CT 4050 

TIGGflGCTIA CaiGHXAAG 003300333 TlGflllLTftA. miCAQ3A03 4100 

G333TGAGAA CAATiaCCAC 1O33AO30X AICAOGfEO 1 O3A0CERCX33 4150 

CAAbTlULTl' 0333A03333 OGJIUL'lUAQG MXHOCTEKT 4200 

TITSIGaOSA UlOjCftCTCC AOSSMGOZft. Q ^iULMUlT Q3GagCI3JC 4250 

ACT3ID3TIG GAL'IOJUUUG Q0GRGACIO3 TIUIOJI U UL ' 4300 

ooultaoc ocioosqqct ocdicacigt Giaxarocr aackio3aq3 4350 

AJUriUL'ltT GTOCACCAOC GGAGAGKTOC CCTrm033 CAAQ3ZEAIC 4400 

03D3ID3M3 T3MCAAQ33 QQSAflGaCftT CILMLTILT Q0C3OCAAA 4450 

GAAGAAGIGC GA03RGCIO3 CCG03AAOZT G3 10 JU A T1U GQCftTCAAJG 4500 

O3GIO30C1A CE70D3333T CITCA03IOT C l UlimULJL' GAOCAQ3332 4550 

GAIGTIGI03 TO3IGT03AC 03M0CICTC AIGAL'IOJLT TEftC03303A 4600 

CnUsACICT GIGATAGACT GO&CftOGTG 'lOlUACICaG AOOOSATT 4650 

TCRGOCITGA. CCCmJjm AOCKTIGASA CAACEA03CT ODOOCftGGKr 4700 

OC'IUICICCA. G3ACIC&AD3 C03333ZAQ3 ACIG32AQ33 QGAAG3CAQ3 4750 

CKTCmmSi. TTIGI03CAC CG3333AQ0G C002TCC333C AIGTTOGACT 4800 

03T0D3I0CT CT3IGAGIOC TKIGAOSCQG OJ1U1ULT1U GIMGftQCIC 4850 

AO30D33333 AGOSOGT T&Q3CH&03A G33UOOGA. ACAC033333 4900 

GCTKXDSIG TG3CAQ3AQC AILTIUAATT TIO33AG0QC c-!Jt 'I'l'i' A O?? 4950 

G3CK30CA tmwsigoc cacrrrmr ggagagioss 5000 

GAGAACTTIC CmOCTOGT AQQ3IBOZA& GOGA033IGT Q03CEAQ33C 5050 

TCAftOOCXET LCU3CATD3T OQGROCAGRT GIQ3AAGIGT T TV-avu IQGOC 5100 

ITAAA032AC 0CIO3AIG3S 0CAAC7O3X TOCTKDOiG ACCOOOOGCT 5150 

GirCAGAATC AAGTCAOXT GAQ3ZAOGCA ATCAOCAAAT ACA1CA11UAC 5200 

AIQCAIGIOS OD33AOCT33 AQGJ1UJ1UAC GAQCAQCJD33 GlQJlULjl ' lU 5250 

Q03Q03IOrr 03CIOCICTO OD3303IMT GX'iUlCAAC AGQCIGOSTG 5300 

GiamOQG 0CSO3AIO3T CTIGTOOOGS AAQ03332AA TTKEAOC7IGA. 5350 

CAQ3GAGGIT CTCTADZAQ3 AGTI03A1GA GAI03AAGAG TOCTCICfiGC 5400 
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ACTEAOOGIA CA103AGCAA Q3GA3GA2Q0 T0QCIGM30A GITCAAQCW3 5450 

aaoauuLTO gociocigca gaoosqgioc cgocatgcag aggtiatcac 5500 

(JUL'lUL'i m C CAGAOCAACT G3CAGAAACT CGAQ3ICTIT T33903AAGC 5550 

ACAIGTOGAA TriCAlCAGT G33AIACAAT ACTIQ30QQG COU1UAACG 5600 

CiQOCTQSm ffl3033OT T3LT1CAT1U AiUULTlTlA OGCIQOaST 5650 

CAOZAGOOCA CTAAQIACIG GaCAAAOOOT (JU1LT1CA AC AER TTOGGGG 5700 

Q3IQGGIQQC TGOOGAGCIC 0000000303 UIUUIUCEAC 'lUULTl'lUlU 5750 

OJIUL'IUOX TAGCIG303C 030000300 AQ33TIGGAC TOQSGAAGGT 5800 

CCT33TOGAC AT1LT1UCAG QGIKIOOOQC GQ30GIO3O3 GGAGL'ILTIU 5850 

■EAGCATICAA GA2CATGAQC G3IGAQ31O0 OCIOCAOSGA GGAOCIQGTC 5900 

AAIUIOOIGC 033OZAI0CT CIOQ00IQ3A GJULT1UUAG T033IGIQ3T 5950 

CIG03CSQCA KEMZIGCQCC Q3CA03TIG3 O0CGQG0GAG Q333CAGTQC 6000 

AAIQGA3GAA 0333CIAA1A QULT1UJULT O033333GAA O0AIGTTTOC 6050 

aocAOOcacr aogigoosga gaqogaigca oaaooaosos tcacigocat 6100 

ACICAGCAGC CICACIGEAA OOCAQCIOCT GAGGOGACIG CA1CAGIGGA 6150 

TAAQCTCGGA GIGTAGCACr OCMQCIOOS GTIOOIUGLT AAG3GACATC 6200 

TQ3GACIOGA TA3Q33AG3T GCIGAQOGAC TrmAGAOCT G3CTGAAAQC 6250 

CAAGCICATC OGACAACTGC CIQGGATICC L'lTlUlUlUO TO33AGQ303 6300 

G3IA3AG333 G3TCIQ30GA GGAGACG3CA TIBIGCACAC lUJL'lUOJAC 6350 

T3IGGAGCIG AGATCACIG3 ACATCICAAA AA0333A03A IGftQGMOQT 6400 

QGGIUL'IAGG ALX-'IOCAQSA ACA3GTQGAG TO3GAQ3TTC OOCATTAAOG 6450 

OCEACAOCAC G3G000CIGT ALVLUJULTIU CTO0300GAA CIA3AAGTIC 6500 

GOQCTGTGGA G33IGICIG0 AGAQGAATAC GIGGAGAIAA GQ030GIQ3G 6550 

GGAL TI U CA C TBOGEAIOGG GOATCACEAC T3ACAATCTT AAASQ003GT 6600 

G0CAGA3OX ATO33CD3AA TITITCACAG AATIGGA033 OUlUaJOL'lA 6650 

CACAQ3TTIG aJLLLLLTlG CAAGOOCTIG CIGOGGGAGG AG3TATCATT 6700 

CAGAGTAGGA CIOCAOSAGr AO00QGTGGG GTOGCAATEA OOTIG03AQC 6750 

O0GAAO03GA aSIBGOOSIG TIGA03I0CA T30ICACTGA 'lUUL'lULXJAT 6800 

A2AACAGCAG AQ3CGGC03G GAGAAGGTIG G0GAGAG3GT CAOO00CTIC 6850 

TOIGGO0AG0 lOOIOGGCTA GOCAGCIUIU OX'lOCAICr CTCAAG3CAA 6900 

CTIGCAOOQC CAAOGAIGAC T000CTGAOG OOSAGCICAT AGAGGCTAAC 6950 

CIOL ' IUIUGA G3CAGGAGAT G3Q0GGCAAC ATCAOCAGQG TIGAGTCAGA 7000 

GAACAAAGIG GTGAITCIQ3 ACIOCTIOGA T003CTIGIG GCAGAQGAGG 7050 

A1UAG033GA G3ICIO03m GCIGCAGAAA TIOIQ03GAA GICI033AGA 7100 

'l'lUGGaa333 OOCIGOOOGT CTOQ30Q03G COOGACEACA AOOOOQOSCT 7150 

AGTAGAGAGG TG3AAAAAQC CIGACTAOGA AOCAOCTCIG GIOOAIQ3CT 7200 
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GCC03JmJ2 ACCICCAGQG TODCCICC'IG H3CC1CXI30C TO33AAAAAG 7250 

OGTffiOGGflQG TOZTCAQCGA. AICAAOOCm TCEACIQCET T330CGAQCT 7300 

TQCCACCAAA ALMTl'lUQCA. GC'ICCICAAC TRXQ32ATT AOG3GCGACA 7350 

AaaOGACAAC AIOC'ICIGAG CCOGO00CTT Cl U JL'lUJ UL ' OCOOSOCC 7400 

GfiuuriLJAGfT ccraricric caigcooccc eiGGRoaaas agcciqggga 7450 

TOD33ATCIC AGCGACG33T CAIGGTCJGAC QStCAGIRGT GQGGOOGACA 7500 

03GMSOGT OSIGIQCIGC TCAATOTCTT ATIUL'IUGAC AQ30GCACIC 7550 

GICaODOCCT GO3CIG03GA AGAACAAAAA CIQ03CATCA. AC3GACTGAG 7600 

CAAL'lUoTIG CEftOQOCATC ACAATCIOST GEKnCEAOC ACTTCAOQCA 7650 

Gi.ULTJ.UULA AAGGCAGAAG AAAGTCACAT TIGACAGACT QZAAGTICTG 7700 

GAGAQ0CA2T ACCAGGACGT GCICAAQGAG GICAAAQCAG CGG03IGAAA 7750 

AG1GAAGGCT AALT1ULTAT GOGIMaRQGA A3ZTIGCAGC CIGACGCCCC 7800 

C20TICRQC CAAATCCAAG TTIG3CTOTC G93CAAAAGA (JUlUUbTlUC 7850 

aaaacaGAA Aooaasraac cxsoicaac TCa a iGiu A aagaccitct 7900 

Q3AAGACAGT GU^ACACCAA. TaGACACI3\C CAICA2G3X AAGAACGAQG 7950 

TrnCIGOGT TCAGOCIGAG AAGQQ3QGIC GTAAGCCAGC laaiClU flE 8000 

GIGTiaXX33 ADCIGQ303T Q0Q03IGIGC GAGAAGATOG GCCTGTACGA 8050 

Ub'lUb'lTAQC AAGCIO0D0C TGQ003IGAT GGGAAGCTGC TACOGATia: 8100 

AAIACTCAGC AGGACAGaQG GTTGAA3TOC TOGTGCAAQC GIGGAM3IDC 8150 

AAGAAGAGOC CGA3GGGGTT CIOGTAIGAT A3XJGCIGTT TKSOOCAC 8200 

AGTCACIGAG AGGGACATCC GmGGGAGGA. Q3CAATTEAC CAAIGTIGTG 8250 

ADCIQSOX CXZAAGOCD3C G1GGGCATCA AGTOEICRC TCAGAGGCTT 8300 

TATGTTGGGG GOLXJ1LTUAC CAAITCAAGG Q3QGAAAACT Q033CTAOGG 8350 

CRQGIGOOaC QCEfiOaQQaS TJAGTGACAAC ■raOCIGIQGfr AAGACOCTCA 8400 

CTIOZCACAT CAAQ30CD33 GCAGCCIGTC GAQ033ZAQG GCTCCAaGAC 8450 

TOCAOCATQC TO3IGIGTQG CGAOGAGTIA GICSTmiCI GIGAAAGIGC 8500 

G3Q3GTTCCAG GAGGAGGGGG CGAGOCIGAG AQULT1UAOG GAGGCTA3GA 8550 

a^Gsnoc 03dxxmx GQ3Gaoaoac cacaagcaga atacgactig 8600 

GfiGCmiRA. CAICAIGCTC CICCAAGGTG TCAGIOXOC AGGACGGCGC 8650 

TGGAAAGAGG GICEACiaaC TIXXTGIGA. QCCTACAACC CTOGIU3CGA 8700 

GAG00003IG GGAGACAGCA AGACACACTC C^GICAATIC CTGGCIBGGC 8750 

AACATAATCA TGTTIGOQOC CACACTGTGG GOGAGGAJGA. IBCTCKIGAC 8800 

OJALLT1LT1T AQCJGIUL'ILA TOGCCAGGGA. TCAQCTTGAA CA GUL'ILTIA 8850 

ACTGTGAGAT CEAOGGAGCC TGCDOCCA. TAGAACCACT GGATCIACCT 8900 

OCAATCATIC AAAGACIGCA TO3CCICAQC GCAT.LTJ.CAC TDGACAGTIA. 8950 

CICIDCAGGT GAAA1CAA3A GGGIGGOOQC AIGCCTCAGA AAACTIGGGG 9000 
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TOaOOOOCrr OOGftGCriGS AGftCfiQ03QG GCCQGAGOGT CO303CTAQS 9050 

CTIL ' IUIU C A GAQ3RQ3CAG G3CIQ0CAIA. 1GIQ3GAAGT ACCKTICAA 9100 

CIG3C3C2Cm AGAACAAAQC TCAAACICAC TOCAAIAOCG Q333ZD33X 9150 

Q3C7IG3ACTr G?1ULUL,T1UG TICA033CIG GCIACAG033 G33AGACAIT 9200 

-iBTCAcaGas igicickiqc anmnx: rosncrasr TriujL'UAcr 9250 

eriGC'lOCT GCAICmCET OJIUJJJAAC OGKIGAAQGrT 9300 

TO333IAAAC JOC093CCT CITAAQOCAT TIULVLUITIT TmTl ' mT 9350 

TrmmTi' TmcrnrT TrnncnT ocrnocnc iotttoct 9400 

TlLTl ' l'l ' lUC CnCITEAftT GJiULL'lULA '1LTJJAUUJLT AGICMXSXT 9450 

AQCIGIGAAA G3KX33IGAG C03CMGACT QCAGAGAGTC CTCKmCIOS 9500 

CCICIUIGCA 9518 
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MSDJEKFCPt TKHNINRRPQ EVKFFQQQQI W3GJYLLPRR GERDGVRA3R 50 

KISER92ERG KRQPIEKARR PEEKIW&QPG 03TOOiLI£P 100 

RGSRPSWGET IX'KRRSRNLG KVIOmTCCF AEOEYIPLV <3\ED3GAARA 150 

LAB^MKVLED GVNYMO^LP QCSFSIFLLA LUgCLTVEAS AYQtfKNSSGL 200 

YH7TNDCENS SIVYEAADAI 1HTPGOJPCJ KEENASRCW AVIFIVAIRD 250 

GKLPIT2LER HIDIXVGSAT LC5ALYVGDL OGSVECVG3L FUSERRHOT 300 

TQDCNCSIYP OiTIGHRMfiW IlflMMSFIA ALWAQLLRI FQAIMEMIAG 350 

AHWGVLAGIA YESNGNKftK VDJVLLLFPG VDAIQLININ GSKHENSE^L 400 

NCNESEIsTIGW NSSQCPERLA SCHRLTDFAQ GWGPISYSN3 450 

9GIXEKPXCW HKEPRP03IV PAKSV03FVY CPTPSPVWG TITR9GAFIY 500 

St^NDIDVF VLNNIRPPLG IWGCIWMNS TGPTKVDGAP PCVIQGWGPN 550 

TLICFIDCER KHFEATYSRC G9GEWTTETC MVDXFXRLJWH YPCTINXTIF 600 

KVFMYVQGVE HRLEAACNWT PGEEdXEER DRSELSPLLiL SITQWQtfLEC 650 

SFTCLPALST GLEHIH3tEV DWJXLXGVGS SIASrtAIKWE vwri. FrT.TA 700 

EMMSCUWM MLU9QRERA LENLVILNAA SLAGIH3LVS 750 

IJ<GRWVPGAV mXQMELL T1T1T.TALPQRA 033WLVC2M 800 

ALTLSFYYKR YIS&OWWLQ YFUIKVH^QL HVWVEELNVR GGBIAVIIIM 850 

CWHPILVFD ITKLLLAEFG ELMRQASLL KVPYFVRVQS LLKECALARK 900 

IAQGHYVQ^ HKD3UJIGT YVYNHLJTPLR DWAHNC31KDL AVAVEEWFS 950 

RMEIIKLriWG ADIAA03DII N3LFVSARRG QEHLLGGBDS MVSKHrKLEA 1000 

pnRYAQQIR Gm3CmSL TGREKN3VB3 EVQIVSIKIQ TF1AKI0SIG7 1050 

CWIVYB3AGT RTIASEK3V IQM5OT3VDQD LVGWPAPQGS ESLTECTOGS 1100 

SEOLVIKHA EV1WKHHGD SRGSLLSPRP ISYLKGS9QG ELLTPAGHAV 1150 

GLFRAAVCIR GUAKAVEFIP VEND3ITMRS PVPHUSSPP AVPQSPQ^H 1200 

LHAPIG9C3CS 1KVPAAYAAQ CTKVLVINES VAATLSFGKy MSKAH3WDEN 1250 

IKlbVKiTlT GSPnYSITC KFLADGGC9S O^DIIICDE CHSIDKTSIL 1300 

GIGIVUDQAE TAGWRLVVLA TATPPGSVIV SHENTERUflL STIGEIEFXG 1350 

KAIPLEVIKG (SHLIPCHSK KKTTM AAKL VMJSENAVAY YRGLEWSVIP 1400 

TSGDJVWST EftlMIGFIGD FDSVIDCNIC VIQIVDFSID PltTlRLTlL 1450 

PQDAVSRTQR PGRT3«2CPG IYRFVAPGEER P904FDSSVL CECXDAGCAW 1500 

YELTEAETIV RLRAYMNTPG LPVOQEHLEF WEBVFIGCIH IDKHFLSQflK 1550 

Q9GENFFXLV AtfQAJVCARA QAPPPSWDQM WKTT.TRLKPT IH3PIPLLYR 1600 

LGAVgSEVIL THPTIKYZMT CMSADLEWT SIWVLVQ3VL AALAAYCLSJT 1650 

QCWIVGRIV LSGKPAIIED REWLYQEEDE MEECSSiLPiT IE33MLAEQ 1700 

FKQKAL3LLQ 1ASRHAEVTT PAV0INWQKL EVFWAKHMrtN FIS3IQYIAG 1750 

LSTLPC3SEAI ASIMAFIAAV TSELTIQQfIL LFimX3QWWA AQLAAFGAKT 1800 

AFVGAGLAGA AIGSVGD3KV LVDIIAGXGA OZAGALVAEK IM9GEVPSTE 1850 

I^miiEAIL SPC^LWGW CAAHtfRHVG PGE3AVQSAMN RLIAEASRGN 1900 
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HVSPIHWPE SDAAAKVIAI LSSLTVIQLL RRUQWISSE CTTPC9G9WL 1950 

RDBOttCEV LSEEKTWUKA KL24PQLPGTP FVSC3QRGXFG VWRODGIMHr 2000 

RCHDGAETIG HVKNGIMRIV GERTCEWWS GIFFWKCTT GECIELPAHST 2050 

YKERLWRVSA EEYVK1KKVG DFHWSGMIT ENLiOOQIP SEEFPIEUDG 2100 

VRLHREAPPC KELLEEEVSF RVGUIEXEM3 SQLPCEPEED VAVLTSMLTD 2150 

PSHTIAEAAG RRLARGSFPS MASSSA9QLS APSU<AICEA NHDSPDAET.T 2200 

GC3STIRVESE NKWUDSFD FLVAEEDERE VSVEAETLEK 2250 

SRREARALEV V2AREDXNPEL VEIWKKEDXE PPWH3CELP EERSFEVFPP 2300 

KKKKIWLTE STLSIMAEL AIKSPGSSST SLLL1GLNITT SSEEAP9QCP 2350 

PDSDVESYSS MPELEEEPGD EDLSDGSWST VS9GRDIEW VOCSMSYEWT 2400 

GALVITCAAE B3KLPINALS NSIZ^HHNLV YSTISRSAOQ FQKKOTH3RL 2450 

QJLDSHXJW IKEVKAAASK VKANLLSVEE ACSLTFFHSA KSKP3YGAKD 2500 

VIOIARKAVA HIMSVWKE3LL EDSVIPUJIT EtiAKNEVTCV QEEKQGKKPA 2550 

mZWFDLGJ HVCEKMALYD WSKLPLAVM GSSYGFQYSP GQEWEFLVQA 2600 

WKSKKTEM3F S YDIRCFD ST VTESDIRIEE AIYgmnP QARWAIKSCT 2650 

EE^LYVQGELT NSRGEN33XR RCRA9GyLTT SOGNILTCYI KARAACRAAG 2700 

D3DCIMLVOG WDJVICESA GVQEEAASUR AFIEAMIKYS APPGDPPQPE 2750 

YDLELTISCS SNVSVAHDGA. GKKVYYLTKD FITELARAAW EIMRHTEVNS 2800 

VauajriMEAP TLWAIMIIMr HETSVLIARD QLEQALNCEL YGACXSIEEL 2850 

IX.PPIIQRLH GLSAFSLHSY SPGEINRWAA CLSKLGVFEL RMflRHRARSV 2900 

RARlXSRGai AAIOSQfLFN WAVKIKLKLT PIAAAGKLDL 33^FI2GYSG 2950 

GDIYHSVSHA REf^WFWCXL LLAAGVGIYL LENR 2984 
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SEQUENCE LISTING 

<110> FORNS, XAVIER 
BUKH, JENS 
EMERSON , SUZANNE U. 
PURCELL, ROBERT H. 

<120> NUCLEIC ACID MOLECULES ENCODING HEPATITIS C VIRUS WHICH 
LACK HYPERVARIABLE REGION ONE OF E2 AND USES THEREOF 

<130> 2026-431PC 

<140> To Be Assigned 
<141> 2000-09-22 

<150> 60/155,823 
<151> 1999-09-23 

<160> 12 

<170> Patentln Ver. 2.1 

<210> 1 
<211> 9518 
<212> DNA 

<213> Hepatitis C virus 
<400> 1 

gccagccccc tgatgggggc gacactccac catgaatcac tcccctgtga ggaactactg 60 
tcttcacgca gaaagcgtct agccatggcg ttagtatgag tgtcgtgcag cctccaggac 120 
cccccctccc gggagagcca tagtggtctg cggaaccggt gagtacaccg gaattgccag 180 
gacgaccggg tcctttcttg gataaacccg ctcaatgcct ggagatttgg gcgtgccccc 240 
gcaagactgc tagccgagta gtgttgggtc gcgaaaggcc ttgtggtact gcctgatagg 300 
gtgcttgcga gtgccccggg aggtctcgta gaccgtgcac catgagcacg aatcctaaac 360 
ctcaaagaaa aaccaaacgt aacaccaacc gtcgcccaca ggacgtcaag ttcccgggtg 420 
gcggtcagat cgttggtgga gtttacttgt tgccgcgcag gggccctaga ttgggtgtgc 4 80 
gcgcgacgag gaagacttcc gagcggtcgc aacctcgagg tagacgtcag cctatcccca 54 0 
aggcacgtcg gcccgagggc aggacctggg ctcagcccgg gtacccttgg cccctctatg 600 
gcaatgaggg ttgcgggtgg gcgggatggc tcctgtctcc ccgtggctct cggcctagct 660 
ggggccccac agacccccgg cgtaggtcgc gcaatttggg taaggtcatc gataccctta 720 
cgtgcggctt cgccgacctc atggggtaca taccgctcgt cggcgcccct cttggaggcg 780 
ctgccagggc cctggcgcat ggcgtccggg ttctggaaga cggcgtgaac tatgcaacag 84 0 
ggaaccttcc tggttgctct ttctctatct tccttctggc cctgctctct tgcctgactg 900 
tgcccgcttc agcctaccaa gtgcgcaatt cctcggggct ttaccatgtc accaatgatt 960 
gccctaactc gagtattgtg tacgaggcgg ccgatgccat cctgcacact ccggggtgtg 1020 
tcccttgcgt tcgcgagggt aacgcctcga ggtgttgggt ggcggtgacc cccacggtgg 1080 
ccaccaggga cggcaaactc cccacaacgc agcttcgacg tcatatcgat ctgcttgtcg 1140 
ggagcgccac cctctgctcg gccctctacg tgggggacct gtgcgggtct gtctttcttg 1200 
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ttggtcaact gtttaccttc tctcccaggc 
ctatctatcc cggccatata acgggtcatc 
cccctacggc agcgttggtg gtagctcagc 
tgatcgctgg tgctcactgg ggagtcctgg 
actgggcgaa ggtcctggta gtgctgctgc 
tcaacaccaa cggcagttgg cacatcaata 
acaccggctg gttagcaggg ctcttctatc 
agaggttggc cagctgccga cgccttaccg 
atgccaacgg aagcggcctc gacgaacgcc 
gtggcattgt gcccgcaaag agcgtgtgtg 
tggtggtggg aacgaccgac aggtcgggcg 
cggatgtctt cgtccttaac aacaccaggc 
ggatgaactc aactggattc accaaagtgt 
tgggcaacaa caccttgctc tgccccactg 
actctcggtg cggctccggt ccctggatta 
ggctttggca ctatccttgt accatcaatt 
gaggggtcga gcacaggctg gaagcggcct 
tggaagacag ggacaggtcc gagctcagcc 
tccttccgtg ttctttcacg accctgccag 
agaacattgt ggacgtgcag tacttgtacg 
ttaagtggga gtacgtcgtt ctcctgttcc 
gcttgtggat gatgttactc atatcccaag 
tcaatgcagc atccctggcc gggacgcacg 
ttgcgtggta tctgaagggt aggtgggtgc 
ggcctctcct cctgctcctg ctggcgttgc 
tggccgcgtc gtgtggcggc gttgttcttg 
attacaagcg ctatatcagc tggtgcatgt 
aagcgcaact gcacgtgtgg gttccccccc 
tcttactcat gtgtgtagta cacccgaccc 
ccatcttcgg acccctttgg attcttcaag 
gcgttcaagg ccttctccgg atctgcgcgc 
tgcaaatggc catcatcaag ttaggggcgc 
cccctcttcg agactgggcg cacaacggcc 
tcgtcttctc ccgaatggag accaagctca 
gtgacatcat caacggcttg cccgtctctg 
cagccgacgg aatggtctcc aaggggtgga 
agcagacgag aggcctccta gggtgtataa 
aagtggaggg tgaggtccag atcgtgtcaa 
tcaatggggt atgctggact gtctaccacg 
agggtcctgt catccagatg tataccaatg 
ctcaaggttc ccgctcattg acaccctgta 
cgaggcacgc cgatgtcatt cccgtgcgcc 
cgccccggcc catttcctac ttgaaaggct 
gacacgccgt gggcctattc agggccgcgg 
actttatccc tgtggagaac ctagggacaa 
cctctccacc agcagtgccc cagagcttcc 
gcggtaagag caccaaggtc ccggctgcgt 
tcaacccctc tgttgctgca acgctgggct 



gccactggac gacgcaagac tgcaattgtt 1260 
gcatggcatg ggatatgatg atgaactggt 1320 
tgctccggat cccacaagcc atcatggaca 1380 
cgggcatagc gtatttctcc atggtgggga 144 0 
tatttgccgg cgtcgacgcg atccaactga 1500 
gcacggcctt gaattgcaat gaaagcctta 1560 
aacacaaatt caactcttca ggctgtcctg 1620 
attttgccca gggctggggt cctatcagtt 1680 
cctactgctg gcactaccct ccaagacctt 174 0 
gcccggcata ttgcttcact cccagccccg 1800 
cgcctaccta cagctggggt gcaaatgata 1860 
caccgctggg caattggttc ggttgtacct 1920 
gcggagcgcc cccttgtgtc atcggagggg 1980 
attgcttccg caaacatccg gaagccacat 2040 
cacccaggtg catggtcgac tacccgtata 2100 
acaccatatt caaagtcagg atgtacgtgg 2160 
gcaactggac gcggggcgaa cgctgtgatc 2220 
cgttgctgct gtccaccaca cagtggcagg 2280 
ccttgtccac cggcctcatc cacctccacc 2340 
gggtagggtc aagcatcgcg tcctgggcca 2400 
ttctgcttgc agacgcgcgc gtctgctcct 2460 
cggaggcggc tttggagaac ctcgtaatac 2520 
gtcttgtgtc cttcctcgtg ttcttctgct 2580 
ccggagcggt ctacgccctc tacgggatgt 2640 
ctcagcgggc atacgcactg gacacggagg 2700 
tcgggttaat ggcgctgact ctgtcgccat 2760 
ggtggcttca gtattttctg accagagtag 2820 
tcaacgtccg gggggggcgc gatgccgtca 2880 
tggtatttga catcaccaaa ctactcctgg 2940 
ccagtttgct taaagtcccc tacttcgtgc 3000 
tagcgcggaa gatagccgga ggtcattacg 3060 
ttactggcac ctatgtgtat aaccatctca 3120 
tgcgagatct ggccgtggct gtggaaccag 3180 
tcacgtgggg ggcagatacc gccgcgtgcg 3240 
cccgtagggg ccaggagata ctgcttgggc 33 00 
ggttgctggc gcccatcacg gcgtacgccc 3360 
tcaccagcct gactggccgg gacaaaaacc 3420 
ctgctaccca aaccttcctg gcaacgtgca 3480 
gggccggaac gaggaccatc gcatcaccca 3540 
tggaccaaga ccttgtgggc tggcccgctc 3600 
cctgcggctc ctcggacctt tacctggtca 3660 
ggcgaggtga tagcaggggt agcctgcttt 3720 
cctcgggggg tccgctgttg tgccccgcgg 3780 
tgtgcacccg tggagtggct aaagcggtgg 3840 
ccatgagatc cccggtgttc acggacaact 3900 
aggtggccca cctgcatgct cccaccggca 3960 
acgcagccca gggctacaag gtgttggtgc 4020 
ttggtgctta catgtccaag gcccatgggg 4080 
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ttgatcctaa tatcaggacc ggggtgagaa 
ccacctacgg caagttcctt gccgacggcg 
tttgtgacga gtgccactcc acggatgcca 
accaagcaga gactgcgggg gcgagactgg 
ccgtcactgt gtcccatcct aacatcgagg 
ccttttacgg caaggctatc cccctcgagg 
gccactcaaa gaagaagtgc gacgagctcg 
ccgtggccta ctaccgcggt cttgacgtgt 
tcgtgtcgac cgatgctctc atgactggct 
gcaacacgtg tgtcactcag acagtcgatt 
caaccacgct cccccaggat gctgtctcca 
ggaagccagg catctataga tttgtggcac 
cgtccgtcct ctgtgagtgc tatgacgcgg 
agactacagt taggctacga gcgtacatga 
atcttgaatt ttgggagggc gtctttacgg 
cccagacaaa gcagagtggg gagaactttc 
gcgctagggc tcaagcccct cccccatcgt 
ttaaacccac cctccatggg ccaacacccc 
aagtcaccct gacgcaccca atcaccaaat 
aggtcgtcac gagcacctgg gtgctcgttg 
gcctgtcaac aggctgcgtg gtcatagtgg 
ttatacctga cagggaggtt ctctaccagg 
acttaccgta catcgagcaa gggatgatgc 
gcctcctgca gaccgcgtcc cgccatgcag 
ggcagaaact cgaggtcttt tgggcgaagc 
acttggcggg cctgtcaacg ctgcctggta 
cagctgccgt caccagccca ctaaccactg 
ggtgggtggc tgcccagctc gccgcccccg 
tagctggcgc cgccatcggc agcgttggac 
ggtatggcgc gggcgtggcg ggagctcttg 
cctccacgga ggacctggtc aatctgctgc 
tcggtgtggt ctgcgcagca atactgcgcc 
aatggatgaa ccggctaata gccttcgcct 
acgtgccgga gagcgatgca gccgcccgcg 
cccagctcct gaggcgactg catcagtgga 
gttcctggct aagggacatc tgggactgga 
ggctgaaagc caagctcatg ccacaactgc 
ggtatagggg ggtctggcga ggagacggca 
agatcactgg acatgtcaaa aacgggacga 
acatgtggag tgggacgttc cccattaacg 
ctgcgccgaa ctataagttc gcgctgtgga 
ggcgggtggg ggacttccac tacgtatcgg 
gccagatccc atcgcccgaa tttttcacag 
cgcccccttg caagcccttg ctgcgggagg 
acccggtggg gtcgcaatta ccttgcgagc 
tgctcactga tccctcccat ataacagcag 
cacccccttc tatggccagc tcctcggcta 
cttgcaccgc caaccatgac tcccctgacg 



caattaccac tggcagcccc atcacgtact 4140 
ggtgctcagg aggtgcttat gacataataa 4200 
catccatctt gggcatcggc actgtccttg 4260 
ttgtgctcgc cactgctacc cctccgggct 4320 
aggttgctct gtccaccacc ggagagatcc 4380 
tgatcaaggg gggaagacat ctcatcttct 4440 
ccgcgaagct ggtcgcattg ggcatcaatg 4500 
ctgtcatccc gaccagcggc gatgttgtcg 4560 
ttaccggcga cttcgactct gtgatagact 4620 
tcagccttga ccctaccttt accattgaga 4680 
ggactcaacg ccggggcagg actggcaggg 4740 
cgggggagcg cccctccggc atgttcgact 4 800 
gctgtgcttg gtatgagctc acgcccgccg 4 860 
acaccccggg gcttcccgtg tgccaggacc 4920 
gcctcactca tatagatgcc cactttttat 4980 
cttacctggt agcgtaccaa gccaccgtgt 504 0 
gggaccagat gtggaagtgt ttgatccgcc 5100 
tgctatacag actgggcgct gttcagaatg 5160 
acatcatgac atgcatgtcg gccgacctgg 5220 
gcggcgtcct ggctgctctg gccgcgtatt 5280 
gcaggatcgt cttgtccggg aagccggcaa 5340 
agttcgatga gatggaagag tgctctcagc 54 00 
tcgctgagca gttcaagcag aaggccctcg 5460 
aggttatcac ccctgctgtc cagaccaact 5520 
acatgtggaa tttcatcagt gggatacaat 5580 
accccgccat tgcttcattg atggctttta 5640 
gccaaaccct cctcttcaac atattggggg 5700 
gtgccgctac tgcctttgtg ggtgctggcc 5760 
tggggaaggt cctcgtggac attcttgcag 5820 
tagcattcaa gatcatgagc ggtgaggtcc 5880 
ccgccatcct ctcgcctgga gcccttgtag 5940 
ggcacgttgg cccgggcgag ggggcagtgc 6000 
cccgggggaa ccatgtttcc cccacgcact 6060 
tcactgccat actcagcagc ctcactgtaa 6120 
taagctcgga gtgtaccact ccatgctccg 6180 
tatgcgaggt gctgagcgac tttaagacct 6240 
ctgggattcc ctttgtgtcc tgccagcgcg 6300 
ttatgcacac tcgctgccac tgtggagctg 63 60 
tgaggatcgt cggtcctagg acctgcagga 6420 
cctacaccac gggcccctgt actccccttc 6480 
gggtgtctgc agaggaatac gtggagataa 6540 
gtatgactac tgacaatctt aaatgcccgt 6600 
aattggacgg ggtgcgccta cacaggtttg 6660 
aggtatcatt cagagtagga ctccacgagt 6720 
ccgaaccgga cgtagccgtg ttgacgtcca 6780 
aggcggccgg gagaaggttg gcgagagggt 6840 
gccagctgtc cgctccatct ctcaaggcaa 6900 
ccgagctcat agaggctaac ctcctgtgga 6960 
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ggcaggagat gggcggcaac atcaccaggg ttgagtcaga gaacaaagtg gtgattctgg 7020 
actccttcga tccgcttgtg gcagaggagg atgagcggga ggtctccgta cctgcagaaa 7080 
ttctgcggaa gtctcggaga ttcgcccggg ccctgcccgt ctgggcgcgg ccggactaca 7140 
accccccgct agtagagacg tggaaaaagc ctgactacga accacctgtg gtccatggct 7200 
gcccgctacc acctccacgg tcccctcctg tgcctccgcc tcggaaaaag cgtacggtgg 7260 
tcctcaccga atcaacccta tctactgcct tggccgagct tgccaccaaa agttttggca 7320 
gctcctcaac ttccggcatt acgggcgaca atacgacaac atcctctgag cccgcccctt 7380 
ctggctgccc ccccgactcc gacgttgagt cctattcttc catgcccccc ctggaggggg 7440 
agcctgggga tccggatctc agcgacgggt catggtcgac ggtcagtagt ggggccgaca 7500 
cggaagatgt cgtgtgctgc tcaatgtctt attcctggac aggcgcactc gtcaccccgt 7560 
gcgctgcgga agaacaaaaa ctgcccatca acgcactgag caactcgttg ctacgccatc 7620 
acaatctggt gtattccacc acttcacgca gtgcttgcca aaggcagaag aaagtcacat 7680 
ttgacagact gcaagttctg gacagccatt accaggacgt gctcaaggag gtcaaagcag 774 0 
cggcgtcaaa agtgaaggct aacttgctat ccgtagagga agcttgcagc ctgacgcccc 7800 
cacattcagc caaatccaag tttggctatg gggcaaaaga cgtccgttgc catgccagaa 7860 
aggccgtagc ccacatcaac tccgtgtgga aagaccttct ggaagacagt gtaacaccaa 7920 
tagacactac catcatggcc aagaacgagg ttttctgcgt tcagcctgag aaggggggtc 7980 
gtaagccagc tcgtctcatc gtgttccccg acctgggcgt gcgcgtgtgc gagaagatgg 8040 
ccctgtacga cgtggttagc aagctccccc tggccgtgat gggaagctcc tacggattcc 8100 
aatactcacc aggacagcgg gttgaattcc tcgtgcaagc gtggaagtcc aagaagaccc 8160 
cgatggggtt ctcgtatgat acccgctgtt ttgactccac agtcactgag agcgacatcc 8220 
gtacggagga ggcaatttac caatgttgtg acctggaccc ccaagcccgc gtggccatca 8280 
agtccctcac tgagaggctt tatgttgggg gccctcttac caattcaagg ggggaaaact 8340 
gcggctaccg caggtgccgc gcgagcggcg tactgacaac tagctgtggt aacaccctca 8400 
cttgctacat caaggcccgg gcagcctgtc gagccgcagg gctccaggac tgcaccatgc 8460 
tcgtgtgtgg cgacgactta gtcgttatct gtgaaagtgc gggggtccag gaggacgcgg 852 0 
cgagcctgag agccttcacg gaggctatga ccaggtactc cgcccccccc ggggaccccc 8580 
cacaaccaga atacgacttg gagcttataa catcatgctc ctccaacgtg tcagtcgccc 8640 
acgacggcgc tggaaagagg gtctactacc ttacccgtga ccctacaacc cccctcgcga 8700 
gagccgcgtg ggagacagca agacacactc cagtcaattc ctggctaggc aacataatca 8760 
tgtttgcccc cacactgtgg gcgaggatga tactgatgac ccatttcttt agcgtcctca 8820 
tagccaggga tcagcttgaa caggctctta actgtgagat ctacggagcc tgctactcca 8880 
tagaaccact ggatctacct ccaatcattc aaagactcca tggcctcagc gcattttcac 8940 
tccacagtta ctctccaggt gaaatcaata gggtggccgc atgcctcaga aaacttgggg 9000 
tcccgccctt gcgagcttgg agacaccggg cccggagcgt ccgcgctagg cttctgtcca 9060 
gaggaggcag ggctgccata tgtggcaagt acctcttcaa ctgggcagta agaacaaagc 9120 
tcaaactcac tccaatagcg gccgctggcc ggctggactt gtccggttgg ttcacggctg 9180 
gctacagcgg gggagacatt tatcacagcg tgtctcatgc ccggccccgc tggttctggt 9240 
tttgcctact cctgctcgct gcaggggtag gcatctacct cctccccaac cgatgaaggt 9300 
tggggtaaac actccggcct cttaagccat ttcctgtttt tttttttttt tttttttttt 9360 
ttttcttttt ttttttcttt cctttccttc tttttttcct ttctttttcc cttctttaat 9420 
ggtggctcca tcttagccct agtcacggct agctgtgaaa ggtccgtgag ccgcatgact 9480 
gcagagagtg ctgatactgg cctctctgca gatcatgt 95 i 8 

<210> 2 
<211> 2984 
<212> PRT 
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<213> Hepatitis C virus 
<400> 2 

Met Ser Thr Asn Pro Lys Pro Gin Arg Lys Thr Lys Arg Asn Thr Asn 
1 5 10 15 

Arg Arg Pro Gin Asp Val Lys Phe Pro Gly Gly Gly Gin lie Val Gly 
20 25 30 

Gly Val Tyr Leu Leu Pro Arg Arg Gly Pro Arg Leu Gly Val Arg Ala 
35 40 45 

Thr Arg Lys Thr Ser Glu Arg Ser Gin Pro Arg Gly Arg Arg Gin Pro 
50 55 60 

He Pro Lys Ala Arg Arg Pro Glu Gly Arg Thr Trp Ala Gin Pro Gly 
65 70 75 80 

Tyr Pro Trp Pro Leu Tyr Gly Asn Glu Gly Cys Gly Trp Ala Gly Trp 
85 90 95 

Leu Leu Ser Pro Arg Gly Ser Arg Pro Ser Trp Gly Pro Thr Asp Pro 
100 105 no 

Arg Arg Arg Ser Arg Asn Leu Gly Lys Val He Asp Thr Leu Thr Cys 
115 120 125 

Gly Phe Ala Asp Leu Met Gly Tyr He Pro Leu Val Gly Ala Pro Leu 
130 135 140 

Gly Gly Ala Ala Arg Ala Leu Ala His Gly Val Arg Val Leu Glu Asp 
145 150 155 160 

Gly Val Asn Tyr Ala Thr Gly Asn Leu Pro Gly Cys Ser Phe Ser He 
165 170 175 

Phe Leu Leu Ala Leu Leu Ser Cys Leu Thr Val Pro Ala Ser Ala Tyr 
180 185 190 

Gin Val Arg Asn Ser Ser Gly Leu Tyr His Val Thr Asn Asp Cys Pro 
195 200 205 

Asn Ser Ser He Val Tyr Glu Ala Ala Asp Ala He Leu His Thr Pro 
210 215 220 

Gly Cys Val Pro Cys Val Arg Glu Gly Asn Ala Ser Arg Cys Trp Val 
225 230 235 240 
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Ala Val Thr Pro Thr Val Ala Thr Arg Asp Gly Lys Leu Pro Thr Thr 
245 250 255 

Gin Leu Arg Arg His He Asp Leu Leu Val Gly Ser Ala Thr Leu Cys 
260 265 270 

Ser Ala Leu Tyr Val Gly Asp Leu Cys Gly Ser Val Phe Leu Val Gly 
275 280 285 

Gin Leu Phe Thr Phe Ser Pro Arg Arg His Trp Thr Thr Gin Asp Cys 
290 295 300 

Asn Cys Ser He Tyr Pro Gly His He Thr Gly His Arg Met Ala Trp 
305 310 315 320 

Asp Met Met Met Asn Trp Ser Pro Thr Ala Ala Leu Val Val Ala Gin 
325 330 335 

Leu Leu Arg He Pro Gin Ala He Met Asp Met He Ala Gly Ala His 
340 345 350 

Trp Gly Val Leu Ala Gly He Ala Tyr Phe Ser Met Val Gly Asn Trp 
355 360 365 

Ala Lys Val Leu Val Val Leu Leu Leu Phe Ala Gly Val Asp Ala He 
370 375 380 

Gin Leu He Asn Thr Asn Gly Ser Trp His He Asn Ser Thr Ala Leu 
385 390 395 400 

Asn Cys Asn Glu Ser Leu Asn Thr Gly Trp Leu Ala Gly Leu Phe Tyr 
405 410 415 

Gin His Lys Phe Asn Ser Ser Gly Cys Pro Glu Arg Leu Ala Ser Cys 
420 425 430 

Arg Arg Leu Thr Asp Phe Ala Gin Gly Trp Gly Pro He Ser Tyr Ala 
435 440 445 

Asn Gly Ser Gly Leu Asp Glu Arg Pro Tyr Cys Trp His Tyr Pro Pro 
450 455 460 

Arg Pro Cys Gly He Val Pro Ala Lys Ser Val Cys Gly Pro Val Tyr 
465 470 475 480 



Cys Phe Thr Pro Ser Pro Val Val Val Gly Thr Thr Asp Arg Ser Gly 
485 490 495 
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Ala Pro Thr Tyr Ser Trp Gly Ala Asn Asp Thr Asp Val Phe Val Leu 
500 505 510 

Asn Asn Thr Arg Pro Pro Leu Gly Asn Trp Phe Gly Cys Thr Trp Met 
515 520 525 

Asn Ser Thr Gly Phe Thr Lys Val Cys Gly Ala Pro Pro Cys Val He 
530 535 540 

Gly Gly Val Gly Asn Asn Thr Leu Leu Cys Pro Thr Asp Cys Phe Arg 
545 550 555 560 

Lys His Pro Glu Ala Thr Tyr Ser Arg Cys Gly Ser Gly Pro Trp He 
565 570 575 

Thr Pro Arg Cys Met Val Asp Tyr Pro Tyr Arg Leu Trp His Tyr Pro 
580 585 590 

Cys Thr lie Asn Tyr Thr He Phe Lys Val Arg Met Tyr Val Gly Gly 
595 600 605 

Val Glu His Arg Leu Glu Ala Ala Cys Asn Trp Thr Arg Gly Glu Arg 
610 615 620 

Cys Asp Leu Glu Asp Arg Asp Arg Ser Glu Leu Ser Pro Leu Leu Leu 
625 630 635 640 

Ser Thr Thr Gin Trp Gin Val Leu Pro Cys Ser Phe Thr Thr Leu Pro 
645 650 655 

Ala Leu Ser Thr Gly Leu He His Leu His Gin Asn He Val Asp Val 
660 665 670 

Gin Tyr Leu Tyr Gly Val Gly Ser Ser He Ala Ser Trp Ala He Lys 
675 680 685 

Trp Glu Tyr Val Val Leu Leu Phe Leu Leu Leu Ala Asp Ala Arg Val 
690 695 700 

Cys Ser Cys Leu Trp Met Met Leu Leu He Ser Gin Ala Glu Ala Ala 
705 710 715 720 

Leu Glu Asn Leu Val lie Leu Asn Ala Ala Ser Leu Ala Gly Thr His 
725 730 735 

Gly Leu Val Ser Phe Leu Val Phe Phe Cys Phe Ala Trp Tyr Leu Lys 
740 745 75Q 
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Gly Arg Trp Val Pro Gly Ala Val Tyr Ala Leu Tyr Gly Met Trp Pro 
755 760 765 

Leu Leu Leu Leu Leu Leu Ala Leu Pro Gin Arg Ala Tyr Ala Leu Asp 
770 775 7 8 o 

Thr Glu Val Ala Ala Ser Cys Gly Gly Val Val Leu Val Gly Leu Met 
785 7 *0 795 800 

Ala Leu Thr Leu Ser Pro Tyr Tyr Lys Arg Tyr lie Ser Trp Cys Met 
805 810 " sis 

Trp Trp Leu Gin Tyr Phe Leu Thr Arg Val Glu Ala Gin Leu His Val 
820 8 25 

Trp Val Pro Pro Leu Asn Val Arg Gly Gly Arg Asp Ala Val He Leu 
835 840 845 

Leu Met Cys Val Val His Pro Thr Leu Val Phe Asp lie Thr Lys Leu 
850 855 860 

Leu Leu Ala lie Phe Gly Pro Leu Trp He Leu Gin Ala Ser Leu Leu 
865 8 70 875 880 

Lys Val Pro Tyr Phe Val Arg Val Gin Gly Leu Leu Arg He Cys Ala 
885 ego 895 

Leu Ala Arg Lys He Ala Gly Gly His Tyr Val Gin Met Ala He He 
900 905 910 

Lys Leu Gly Ala Leu Thr Gly Thr Tyr Val Tyr Asn His Leu Thr Pro 
915 920 925 

Leu Arg Asp Trp Ala His Asn Gly Leu Arg Asp Leu Ala Val Ala Val 
930 935 940 

Glu Pro Val Val Phe Ser Arg Met Glu Thr Lys Leu He Thr Trp Gly 
945 9 *0 955 960 

Ala Asp Thr Ala Ala Cys Gly Asp He He Asn Gly Leu Pro Val Ser 
9fi 5 970 975 

Ala Arg Arg Gly Gin Glu lie Leu Leu Gly Pro Ala Asp Gly Met Val 
9 80 985 990 

Ser Lys Gly Trp Arg Leu Leu Ala Pro lie Thr Ala Tyr Ala Gin Gin 
995 1000 1005 
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Thr^Arg Gly Leu Leu Gly Cys He lie Thr Ser Leu Thr Gly Arg Asp 
.t.w 1015 1020 

Lys Asn Gin Val Glu Gly Glu Val Gin He Val Ser Thr Ala Thr Gin 
" 1030 1035 1040 

Thr Phe Leu Ala Thr Cys lie Asn Gly Val Cys Trp Thr Val Tyr His 
1°4S 1050 



1055 



Gly Ala Gly Thr Arg Thr lie Ala Ser Pro Lys Gly Pro Val I 



106 ° 1065 



le Gin 



1070 



Met Tyr Thr Asn Val Asp Gin Asp Leu Val Gly Trp Pro Ala Pro Gin 
1075 10 80 1085 

G1V 1 ^n ^ S " ^ ^ ^ CyS 01 * Ser Ser Asp Leu Tyr 

1090 1Q 95 1100 

Leu Val Thr Arg His Ala Asp Val lie Pro Val Arg Arg Arg Gly Asp 
1110 "IS 1120 



Ser Arg Gly Ser Leu Leu Ser Pro Arg Pro He Ser Tyr Leu Lys Gly 

1125 I" 0 1135 

Ser ser Gly Gly Pro Leu Leu Cys Pro Ala Gly His Ala Val Gly Leu 
1140 H« 1150 

Phe Arg Ala Ala Val Cys Thr Arg Gly Val Ala Lys Ala Val Asp Phe 
1155 use 1165 

Ile il70 ^ Ar9 Ser Pro V *l ^e Thr 

1175 1180 

Asp^Asn ser Ser Pro Pro Ala Val Pro Gin Ser Phe Gin val Ala His 
1190 11" 1200 

Leu His Ala Pro Thr Gly Ser Gly Lys Ser Thr Lys Val Pro Ala Ala 
1205 121° 1215 

Tyr Ala Ala Gin Gly Tyr Lvs Val t-„ w = 1 , ^ „ 

1 Leu Asn Pro Ser v al Ala 
1220 I 2 " 1230 

Ala Thr^u Gly Phe Gly Ala Tyr Met Ser Lys Ala His Gly Val Asp 

1240 1245 

Pro Asn He Arg Thr Gly Val Arg Thr He Thr Thr Gly Ser Pro He 



1250 1255 

ldit>b 1260 
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Thr Tyr Ser Thr Tyr Gly Lys Phe Leu Ala Asp Gly Gly Cys Ser Gly 
1265 1270 1275 1280 

Gly Ala Tyr Asp He He He Cys Asp Glu Cys His Ser Thr Asp Ala 
1285 1290 1295 

Thr Ser He Leu Gly He Gly Thr Val Leu Asp Gin Ala Glu Thr Ala 
1300 1305 1310 

Gly Ala Arg Leu Val Val Leu Ala Thr Ala Thr Pro Pro Gly Ser Val 
1315 1320 1325 

Thr Val Ser His Pro Asn He Glu Glu Val Ala Leu Ser Thr Thr Gly 
1330 1335 1340 

Glu He Pro Phe Tyr Gly Lys Ala He Pro Leu Glu Val He Lys Gly 
1345 1350 1355 1360 

Gly Arg His Leu lie Phe Cys His Ser Lys Lys Lys Cys Asp Glu Leu 
1365 1370 1375 

Ala Ala Lys Leu Val Ala Leu Gly He Asn Ala Val Ala Tyr Tyr Arg 
1380 1385 1390 

Gly Leu Asp Val Ser Val He Pro Thr Ser Gly Asp Val Val Val Val 
1395 1400 1405 

Ser Thr Asp Ala Leu Met Thr Gly Phe Thr Gly Asp Phe Asp Ser Val 
1410 1415 1420 

He Asp Cys Asn Thr Cys Val Thr Gin Thr Val Asp Phe Ser Leu Asp 
1425 1430 1435 1440 

Pro Thr Phe Thr He Glu Thr Thr Thr Leu Pro Gin Asp Ala Val Ser 
1445 1450 1455 

Arg Thr Gin Arg Arg Gly Arg Thr Gly Arg Gly Lys Pro Gly He Tyr 
1460 1465 1470 

Arg Phe Val Ala Pro Gly Glu Arg Pro Ser Gly Met Phe Asp Ser Ser 
1475 1480 1485 

Val Leu Cys Glu Cys Tyr Asp Ala Gly Cys Ala Trp Tyr Glu Leu Thr 
1490 1495 15Q0 

Pro Ala Glu Thr Thr Val Arg Leu Arg Ala Tyr Met Asn Thr Pro Gly 
1505 1510 1515 1520 
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Leu Pro Val Cys Gin Asp His Leu Glu Phe Trp Glu Gly Val Phe Thr 
1525 1530 1535 

Gly Leu Thr His He Asp Ala His Phe Leu Ser Gin Thr Lys Gin Ser 
1540 1545 1550 

Gly Glu Asn Phe Pro Tyr Leu Val Ala Tyr Gin Ala Thr Val Cys Ala 
1555 1560 1565 

Arg Ala Gin Ala Pro Pro Pro Ser Trp Asp Gin Met Trp Lys Cys Leu 
1570 1575 1580 

He Arg Leu Lys Pro Thr Leu His Gly Pro Thr Pro Leu Leu Tyr Arg 
1585 1590 1595 1600 

Leu Gly Ala Val Gin Asn Glu Val Thr Leu Thr His Pro He Thr Lys 
1605 1610 1615 

Tyr He Met Thr Cys Met Ser Ala Asp Leu Glu Val Val Thr Ser Thr 
1620 1625 1630 

Trp Val Leu Val Gly Gly Val Leu Ala Ala Leu Ala Ala Tyr Cys Leu 
1635 1640 1645 

Ser Thr Gly Cys Val Val He Val Gly Arg He Val Leu Ser Gly Lys 
1650 1655 1660 

Pro Ala He He Pro Asp Arg Glu Val Leu Tyr Gin Glu Phe Asp Glu 
1665 1670 1675 1680 

Met Glu Glu Cys Ser Gin His Leu Pro Tyr He Glu Gin Gly Met Met 
1685 1690 1695 

Leu Ala Glu Gin Phe Lys Gin Lys Ala Leu Gly Leu Leu Gin Thr Ala 
1700 1705 i7io 

Ser Arg His Ala Glu Val He Thr Pro Ala Val Gin Thr Asn Trp Gin 
1715 1720 1725 

Lys Leu Glu Val Phe Trp Ala Lys His Met Trp Asn Phe He Ser Gly 
1730 1735 1740 

He Gin Tyr Leu Ala Gly Leu Ser Thr Leu Pro Gly Asn Pro Ala He 
1745 17 50 1755 1760 

Ala Ser Leu Met Ala Phe Thr Ala Ala Val Thr Ser Pro Leu Thr Thr 
1765 1770 1775 
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Gly Gin Thr Leu Leu Phe Asn He Leu Gly Gly Trp Val Ala Ala Gin 
1780 1785 1790 

Leu Ala Ala Pro Gly Ala Ala Thr Ala Phe Val Gly Ala Gly Leu Ala 
1795 1800 1805 

Gly Ala Ala He Gly Ser Val Gly Leu Gly Lys Val Leu Val Asp He 
1810 1815 1820 

Leu Ala Gly Tyr Gly Ala Gly Val Ala Gly Ala Leu Val Ala Phe Lys 
1825 1830 1835 1840 

lie Met Ser Gly Glu Val Pro Ser Thr Glu Asp Leu Val Asn Leu Leu 
1845 1850 1855 

Pro Ala He Leu Ser Pro Gly Ala Leu Val Val Gly Val Val Cys Ala 
1860 1865 1870 

Ala He Leu Arg Arg His Val Gly Pro Gly Glu Gly Ala Val Gin Trp 
1875 1880 1885 

Met Asn Arg Leu He Ala Phe Ala Ser Arg Gly Asn His Val Ser Pro 
1850 1895 1900 

Thr His Tyr Val Pro Glu Ser Asp Ala Ala Ala Arg Val Thr Ala lie 
1905 1910 1915 1920 

Leu Ser Ser Leu Thr Val Thr Gin Leu Leu Arg Arg Leu His Gin Trp 
1925 1930 i 9 35 

He Ser Ser Glu Cys Thr Thr Pro Cys Ser Gly Ser Trp Leu Arg Asp 
1940 1945 1950 

He Trp Asp Trp He Cys Glu Val Leu Ser Asp Phe Lys Thr Trp Leu 
1955 i960 1965 

Lys Ala Lys Leu Met Pro Gin Leu Pro Gly He Pro Phe Val Ser Cys 
1970 1975 1980 

Gin Arg Gly Tyr Arg Gly Val Trp Arg Gly Asp Gly He Met His Thr 
1985 1990 1995 2000 

Arg Cys His Cys Gly Ala Glu He Thr Gly His Val Lys Asn Gly Thr 
2005 2010 2015 

Met Arg He Val Gly Pro Arg Thr Cys Arg Asn Met, Trp Ser Gly Thr 
2 °20 2025 2030 
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Phe Pro lie Asn Ala Tyr Thr Thr Gly Pro Cys Thr Pro Leu Pro Ala 
2035 2040 2045 

Pro Asn Tyr Lys Phe Ala Leu Trp Arg Val Ser Ala Glu Glu Tyr Val 
2050 2055 2060 

Glu He Arg Arg Val Gly Asp Phe His Tyr Val Ser Gly Met Thr Thr 
2065 2070 2075 2080 

Asp Asn Leu Lys Cys Pro Cys Gin He Pro Ser Pro Glu Phe Phe Thr 
2085 2090 2095 

Glu Leu Asp Gly Val Arg Leu His Arg Phe Ala Pro Pro Cys Lys Pro 
2100 2105 2110 

Leu Leu Arg Glu Glu Val Ser Phe Arg Val Gly Leu His Glu Tyr Pro 
2H5 2120 2125 

Val Gly Ser Gin Leu Pro Cys Glu Pro Glu Pro Asp Val Ala Val Leu 
2130 2135 2140 

Thr Ser Met Leu Thr Asp Pro Ser His He Thr Ala Glu Ala Ala Gly 
2145 2150 2155 2160 

Arg Arg Leu Ala Arg Gly Ser Pro Pro Ser Met Ala Ser Ser Ser Ala 
2165 2170 2175 

Ser Gin Leu Ser Ala Pro Ser Leu Lys Ala Thr Cys Thr Ala Asn His 
21 80 2185 2190 

Asp Ser Pro Asp Ala Glu Leu He Glu Ala Asn Leu Leu Trp Arg Gin 
21 95 2200 2205 

Glu Met Gly Gly Asn He Thr Arg Val Glu Ser Glu Asn Lys Val Val 
2210 2215 2220 

He Leu Asp Ser Phe Asp Pro Leu Val Ala Glu Glu Asp Glu Arg Glu 
2225 2 "0 2235 2240 

Val Ser Val Pro Ala Glu He Leu Arg Lys Ser Arg Arg Phe Ala Arg 
2245 2250 2255 

Ala Leu Pro Val Trp Ala Arg Pro Asp Tyr Asn Pro Pro Leu Val Glu 
2260 2265 2270 

Thr Trp Lys Lys Pro Asp Tyr Glu Pro Pro Val Val His Gly Cys Pro 
2275 2280 2 285 
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Leu Pro Pro Pro Arg Ser Pro Pro Val Pro Pro Pro Arg Lys Lys Arg 
2290 2295 2300 

Thr Val Val Leu Thr Glu Ser Thr Leu Ser Thr Ala Leu Ala Glu Leu 
2305 2310 2315 2320 

Ala Thr Lys Ser Phe Gly Ser Ser Ser Thr Ser Gly He Thr Gly Asp 
2325 2330 2335 

Asn Thr Thr Thr Ser Ser Glu Pro Ala Pro Ser Gly Cys Pro Pro Asp 
2340 2345 2350 

Ser Asp Val Glu Ser Tyr Ser Ser Met Pro Pro Leu Glu Gly Glu Pro 
2355 2360 2365 

Gly Asp Pro Asp Leu Ser Asp Gly Ser Trp Ser Thr Val Ser Ser Gly 
2370 2375 2380 

Ala Asp Thr Glu Asp Val Val Cys Cys Ser Met Ser Tyr Ser Trp Thr 
2385 2390 2395 2400 

Gly Ala Leu Val Thr Pro Cys Ala Ala Glu Glu Gin Lys Leu Pro He 
2405 2410 2415 

Asn Ala Leu Ser Asn Ser Leu Leu Arg His His Asn Leu Val Tyr Ser 
2420 2425 2430 

Thr Thr Ser Arg Ser Ala Cys Gin Arg Gin Lys Lys Val Thr Phe Asp 
2435 2440 2445 

Arg Leu Gin Val Leu Asp Ser His Tyr Gin Asp Val Leu Lys Glu Val 
2450 2455 2460 

Lys Ala Ala Ala Ser Lys Val Lys Ala Asn Leu Leu Ser Val Glu Glu 
2465 2470 2475 2480 

Ala Cys Ser Leu Thr Pro Pro His Ser Ala Lys Ser Lys Phe Gly Tyr 
2485 2490 2495 

Gly Ala Lys Asp Val Arg Cys His Ala Arg Lys Ala Val Ala His He 
2500 2505 2510 

Asn Ser Val Trp Lys Asp Leu Leu Glu Asp Ser Val Thr Pro He Asp 
2515 2520 2525 

Thr Thr He Met Ala Lys Asn Glu Val Phe Cys Val Gin Pro Glu Lys 
2530 2535 2540 
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Gly Gly Arg Lys Pro Ala Arg Leu He Val Phe Pro Asp Leu Gly Val 
2545 2550 2555 2560 

Arg Val Cys Glu Lys Met Ala Leu Tyr Asp Val Val Ser Lys Leu Pro 
2565 2570 2575 

Leu Ala Val Met Gly Ser Ser Tyr Gly Phe Gin Tyr Ser Pro Gly Gin 
2580 2585 2590 

Arg Val Glu Phe Leu Val Gin Ala Trp Lys Ser Lys Lys Thr Pro Met 
2595 2600 2605 

Gly Phe Ser Tyr Asp Thr Arg Cys Phe Asp Ser Thr Val Thr Glu Ser 
2610 2615 2620 

Asp He Arg Thr Glu Glu Ala He Tyr Gin Cys Cys Asp Leu Asp Pro 
2625 2630 2635 2640 

Gin Ala Arg Val Ala He Lys Ser Leu Thr Glu Arg Leu Tyr Val Gly 
2645 2650 2655 

Gly Pro Leu Thr Asn Ser Arg Gly Glu Asn Cys Gly Tyr Arg Arg Cys 
2660 2665 2670 

Arg Ala Ser Gly Val Leu Thr Thr Ser Cys Gly Asn Thr Leu Thr Cys 
2675 2680 2685 

Tyr He Lys Ala Arg Ala Ala Cys Arg Ala Ala Gly Leu Gin Asp Cys 
2690 2695 2700 

Thr Met Leu Val Cys Gly Asp Asp Leu Val Val He Cys Glu Ser Ala 
2705 2710 2715 2720 

Gly Val Gin Glu Asp Ala Ala Ser Leu Arg Ala Phe Thr Glu Ala Met 
2725 2730 2735 

Thr Arg Tyr Ser Ala Pro Pro Gly Asp Pro Pro Gin Pro Glu Tyr Asp 
2740 2745 2750 

Leu Glu Leu He Thr Ser Cys Ser Ser Asn Val Ser Val Ala His Asp 
2755 2760 2765 

Gly Ala Gly Lys Arg Val Tyr Tyr Leu Thr Arg Asp Pro Thr Thr Pro 
2770 2775 2780 

Leu Ala Arg Ala Ala Trp Glu Thr Ala Arg His Thr Pro Val Asn Ser 
2785 2790 2795 2800 
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Trp Leu Gly Asn He He Met Phe Ala Pro Thr Leu Trp Ala Arg Met 
2805 2810 2815 

He Leu Met Thr His Phe Phe Ser Val Leu He Ala Arg Asp Gin Leu 
2820 2825 2830 

Glu Gin Ala Leu Asn Cys Glu He Tyr Gly Ala Cys Tyr Ser He Glu 
2835 2840 2845 

Pro Leu Asp Leu Pro Pro He He Gin Arg Leu His Gly Leu Ser Ala 
2850 2855 2860 

Phe Ser Leu His Ser Tyr Ser Pro Gly Glu He Asn Arg Val Ala Ala 
2865 2870 2875 2880 

Cys Leu Arg Lys Leu Gly Val Pro Pro Leu Arg Ala Trp Arg His Arg 
2885 2890 2895 

Ala Arg Ser Val Arg Ala Arg Leu Leu Ser Arg Gly Gly Arg Ala Ala 
2900 2905 2910 

He Cys Gly Lys Tyr Leu Phe Asn Trp Ala Val Arg Thr Lys Leu Lys 
2915 2920 2925 

Leu Thr Pro He Ala Ala Ala Gly Arg Leu Asp Leu Ser Gly Trp Phe 
2930 2935 2940 

Thr Ala Gly Tyr Ser Gly Gly Asp lie Tyr His Ser Val Ser His Ala 
2945 2950 2955 2960 

Arg Pro Arg Trp Phe Trp Phe Cys Leu Leu Leu Leu Ala Ala Gly Val 
2965 2970 2975 

Gly lie Tyr Leu Leu Pro Asn Arg 
2980 



<210> 3 
<211> 38 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
Primer 

<400> 3 

acgcgtctgc agcttaatgg cccaggacgc gatgcttg 
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<210> 4 
<211> 36 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
Primer 

<400> 4 

acgcgtagat cttaccaagt gcgcaattcc tcgggg 

<210> S 
<211> 34 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
Primer 

<400> 5 

tcagttggat agcgtcgacg ccggcaaata gcag 

<210> 6 
<211> 34 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
Primer 

<400> 6 

cgtcgacgcg atccaactga tcaacaccaa cggc 

<210> 7 
<211> 26 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
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Primer 
<400> 7 

cttgtaccat caattacacc atattc 

<210> 8 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
Primer 

<400> 8 

gatagtgcca atgcctatac ggg 

<210> 9 
<211> 41 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
Primer 

<400> 9 

cgtataggca ttggcactat ccttgtacca tcaattacac c 

<210> 10 
<211> 41 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
Primer 

<400> 10 

ggtgtaattg atggtacaag gatagtgcca atgcctatac g 

<210> 11 
<211> 41 
<212> DNA 
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<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
Primer 

<400> 11 

ttttttttgc ggccgcatgg tggggaactg ggcgaaggtc c 

<210> 12 
<211> 42 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
Primer 

<400> 12 

acgcgtaagc ttctattact cggacctgtc cctgtcttcc ag 
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